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PTGtTRE 13 3 

TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
T AAGGAAGAGGAGTTGT ACGC ATGTCAGAG AGGTTGCAGGCTGTTTT CAATTTGT C AGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 
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FTflTTRE 134 

CACACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGCAACAA 
GCTGAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
GGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCC 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 

CCGC 
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TTTflTTRE 135 

GCGAGGTGGCGATCGCTGAGAGGCAC3GAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 

GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 

GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCA&SfiGGCCGCGGCTG 

GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 

AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 

ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 

ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 

GGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 

GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 

AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 

AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 

ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 

CAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 

AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 

TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 

CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 

AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 

GGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 

ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 

AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 

CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 

CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 

GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 

AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 

AATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACT 

TCAGGAACTTGTTACAGAATATTCATTAAAGAAAACAAGCTGATATGTGCCTGTTTCTGGAC 

AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 

ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 

AAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 

GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 13 6 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53974 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYRLF 
PRLQKLLESDYFRYYK^LKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
NNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPE 
RYTGYKGPDAWKIVmVIYEENCFKPQTIKRPLNPIASGQGTSEENTFYSWLEGLCVEKRAFY 
RLISGLHASINVHLSARYLLQETWLEKKWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLI 

E LRALS KVL PF F ERPDFQLFTGNKI QD E ENKMLLLE I LHE I KS F P LHFDENS FFAGDKKEAH 
KLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 

LTRQEIVSLFNAFGRISTSVKELENFRNIjLQNIH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Ami da t ion site. 

amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class -V pyridoxal -phosphate 

amino acids 216-222 



Interleukin-7 proteins 

amino acids 338-343 
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FIGURE 137 

GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTT 
TAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAA 
GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 
TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 
ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 
CTTAAGAAGTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 
CTTNGAGCGCCCAGATTTTCAACTNTTTACTCGAAATAAAATTCAGGATGAGGNAAACAAAA 
TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 



TTTTTTTGCTG 
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FTflURE 13 8 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 

AGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAGAAAC 

CGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 

TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 

CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 

GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 

GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 

ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 

ATGGTGAGCTAAGGAGAGGGTGGTGGCAGTGTCTCTGAAGGTCCATAAAAGAAAAAAGAGAA 

GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 

AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 

GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 

CGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATATGTGCCCGTGTGGATGCTTCATTCCA 

GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 

CCCAGAAGGAGATACTGGGTGGGAAAAAGATGGGGCAAAGCGGTATGATGCCTGGCAAAGGG 

CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 

TTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCACTTTGGGAGGCCAAG 

GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 

CTCTACTAAAAAAAAAAAAATACAAAAATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 

ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 

AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 

AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 

CTCATGCCTGTAATCCCAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAG 

GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 
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FIGURE 13 9 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA57039 
xsubunit 1 of 1, 124 aa, 1 stop 
><MW: 13352, pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DG P SGDRRGDVYRCPVGGAHN APCAKGHLGD YQLGN S S HP AVNMHLGMS LL ETDGDGGFMVS 

Important features: 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 

amino acids 67-81 
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PTflURE 140 

CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 
GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 
TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 
CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 
ATTCCCAGGGCCACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGAC 
AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 
TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 
CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 

CAGATGGTGATGG 
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FIGURE 141 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGAC AGAATGCTTTATTTTGG AAAGAAAC AATGT TCTAGGTCAAACTGAGT CTACCA 
A ATG CAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGC ATC CTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC C CGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGC CTACTGGAGGAGGGAGC CTGGTGC CGAGGAACATGTCAAAATGGTG AGGAGTGG 
GGGTATT C C AGTG C ACCTAGAAAC C ATGGAGCCAGGGGCTGC ATACTGTGTGAAGGCC C AGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCA TAG GTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAG TTGTGTTTCTGTTTTC CG C CACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACAC AC CTG CTAAACACA C ACACAC AGAGTCTCTCTCTAT ATATAC AC ACGTAC AC ATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAG CC C ACTTTC CCAGAAT AATC CT TGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGGAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 
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FIGURE 142 

</usr/seqdb2/sst/DNA/Dnaseqs .rain/ss .DNA57033 
oubunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX(S/T): 2 

MQTFTMVLEE I WTSLFMWFFYALI PCLLTDEVAI LPAPQNLS VLSTNMKHLLMWS PVI APGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
G I P VHLETME PGAAYC VKAQT FVKA I GRYSAF SQTECVEVQGEAI PLVLALFAFVGFML I LV 
WPLFVWKMGRLLQYSCCPVWLPDTLKITNSPQKIjISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 230-255 

N-glycosylation site, 

amino acids 40-43 and 134-137 

Tissue factor proteins. 

amino acids 92-119 

Integrins alpha chain proteins 

amino acids 232-262 
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PTGTTRB 143 

TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 

CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 

TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 

TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 

TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 

CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 

GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 

GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 

ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGGCACATTGGGC 

TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 

CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 

GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 

GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 

ACATACTC AAT ATGGACGAANTGCT ATTGTC CAC CTGTTTG AGTGGCGC TGGGTTGAT 
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TTTOURE 144 

CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 

GGCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 

GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAG 

GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 

GGAGAGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 

GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 

AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 

GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 

GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 

CAGGGGTCCCTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 

GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 

ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 

CTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 

GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 

AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 

GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 

GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 

AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 

CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 

CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 

TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 

TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 

TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 

TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 

AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 

TGGTGTCCCCCATGGTCGCTGAGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 

CCCACAAGGCGAGGGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAGAATACTTA 
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FIGURE 145 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA34353 
<subunit 1 of 1, 32 8 aa, 1 stop 
<MW: 36238, pi: 9.90, NX(S/T): 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLC 
AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPVVNVSGGPL 
LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQIilHFNQELYGNFSAASRGPNGLAILSL 
FVNVASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSE 
TVTWILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

Important features: 
Signal peptide : 
amino acids 1-23 

Transmembrane domain : 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotic- type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 
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FIGURE 146 

GGCGCCTGGTTCTGCGCX3TACTGGCTGTAC 

GTTCGTGTCCCCGCCCCTCGCTCCTG<^GCTACTGCTCAGAAACGCTGGGGCGCCCACCCTGGCAGACTAACGAA 
GCAGCTCCCTTCCCACCCCAACTGCAGGTCTAA 

CGCAGAGGCGGAGGCTCGCGTATTCCTGCAGTCAGCACCCACGTCGCCCCCGG^ 

GCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCX3GTTTGGCT 
CACCTCTCCCAGGAAACTTCAGACTOGAGAGCC^ 

ATCCTGAGGTCATTCATTA^AAGTGTACCGCGCGGGAGTGGCTCAGAGTAACCACAGTGCTGTTCATGGCTAGA 
GCAATT C CAG C CATG GTGGTT C CCAATG C CACTT T ATTG GAGAAACTTTTGGAAAAATACATG G ATGAGG ATGGT 
GAGTGGTGGATAGC CAAACAACGAGGGAAAAGGGC CATCACAGACAATGAGATGCAGAGTATTTTGGACCTTCAT 
AATAAATTACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACZATGGGATGTAGAGCTGGAAAGA 
TCTGCAGAATCCTGGGCTGAAAGTTGCTTGTGGGAACATGGACCTGCAAGCTTGCTTCCA^ 
TTGGGAGCACACTGGGGAAGATATAGGCCCCCGACGTTTC7VTCTACAATCGTGGTATGATGAAG 
AGCTACCCATATGAACATGAATGCAACCCATATTGTCCATTCAGGTGTTCTG 
CAGGTCGTGTGGGCAACTAGTAACAGAATCGGTTGTGCCATTAAl^ 

ATATGGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCA 
CATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCTGTAGAGAAAATCTGTGCTACAAAGAAGGG 
TCAG ACAG GTAT T ATC C C CCT CG AG AAGAGGAAACAAATGAAATAGAA CGACAGCAGTCACAAGT CCATGACACC 
CATGTCCGGACAAGATCAGATGATAGTAGC^GAAATGAAGTCATAAGCGCACAGCAAATGTC 
TGTGAAGTAAGATTAAGAGATCAGTGCAAAGGAACAACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGAT 

AGTAAAGCTAAAGTTATTGGCAGTGTACATTATGAAATGCAATCCM 

ATAATAGACAATCATGGTGGCTGGGTAGATATCACTAGACAAGGAAGAAAGCATTATTTC^^ 
AATGGTATTCAAACAATOGGCAAATATCAGTCIX^TAATT CCTT CAC AGTCTCTAAAGT AACAGTTCAGGCTGTG 
ACTTGTGAAACAACTGTGGAACAGCTCTGTCCATTTCATAAGCCTGCT 
CGTAACTGTATGCAAGGftAATCCAGATTATGCTCGTGTAATTGGAACTCGA 
TGCAGAGCAGCAGTACATGCTGGAGTGGTTCXSAAATCACXSGTGGTTATGTTGATGTA^ 

AAGAC CTACATTGCTTCTTTT CAGAATGG AAT C T T C T CAGAAAG TT T ACAGAATC CTCCAGGAGGAAAGG CATT C 
AGAGTGTTTGCTGTTGTGTGAAACTGAATACTTG 

ATTTTGTATAAAACTGTAACATTACTGTACAGAGTACATCAACTATTTTCAGCCCAAA 

TAAATCTTGATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATA 

TTTT AGAAATCCTGTGTTAAATATTG CTATATTTTCTTAGCAGTTATTTCTACAGTTAATTACATAGT CATGATT 

GTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTAATAAAATGAATCTAAACATTGAATGTGAATG 

GCCCTCAGAAAAT CATCTAGTGCATTTAAAAATAATCGACTCT AAAACTGAAAGAAACCTTAT CACAT TTTCC CC 

AGTTCAATGCTATGCCATTACCAACT C CAAATAATCTCAAATAATTTT CCACTTAATAACTGTAAAGT TTTTTTC 

TGT T AAT TT AGGGAT AT AGAATAT T AAAT TC TGAT ATTG CACT T C TT ATTTT ATAT AAAATAAT C CTT TAATAT C 

CAAATGAATCTGTTAAAATGTTTGATTCCTTGGGAATGGCCTTAAAAATAAATGTAATAAAGTCAGAGTGGTGGT 

ATGAAAACATTCCTAGTGATCATGTAGTAAATGTAGGGTTAAGCATCGACAGCCAGAGCTTTCTATGTACTC 

AAATTGAGGTCACATATTTTCTTTTGTATCCTGGCAAATACTCCTGCAGGCCAGGAAGTATAATAGC^ 

GAACAAAGATGAACTAATGTATTACATT ACCATTGC CACTGATTTTTTTTAAATGGTAAATGAC CTTGT ATATAA 

ATATTGCCATATCATGGTACCTATAATGGTGATATATTTGTTTCTATGAAAAATGTATTGTGCTTTGATACTAAA 

AATCTGTAAAATGTTAGTTTTGGTAATTTTT^TTCTGCTGGTGGATTTACATATTAAATTTTTTCTGCTGGTG^ 

TAAACATTAAAATTAAT CATGTTTCAAAAAAAAAAAAA 
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FIGURE 147 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX (S/T) : 2 
MKCTAREWLRVTTVLFRA.RAIPAMV 

QSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHW 
GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQVWATSNRIGCAINLC 
HNMNI WGQ I WPKAVYLVCNYS PKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 
YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 
YECPAGCLDSKAKVIGSVHYEMQSS ICRAAIHYGI IDNDGGWVDITRQGRKHYFIKSNRNGI 
QTIGKYQSANSFTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 
TRVYSDLSS I CRAAVHAGWRNHGGYVDVMPVDKRKTY I ASFQNGI FSESLQNPPGGKAFRV 
FAW 

Important features : 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx- 1/Ag5/PR- 1/Sc7 protein 
amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 

amino acids 28-31 
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FIGURE 148 

GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 
CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 
GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCG 
CGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 
ACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 

AG CGACGTGAAGAAGCTGGAAATGAAGC C AAAGTAC C CG C ACTGCG AGGAG AAGATGGTT AT 

CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 

AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 

GAATAGGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 

CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTC 

TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 

TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 

CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 

AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 

GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 

TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 

GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 

CTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 

GGTTAATTTTTAC ACAC CCTAGGAAAC ATTTC C AAGATC CTGTGATGGCGAGACAAATGAT C 

CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 

TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 

TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAA 

AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 

TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 

GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 

CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 

TGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 

AAAAAAAAAAA 
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FTGURE 149 

MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPH 
CEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 

Signal sequence: 

amino acids 1-34 
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FIGURE 150 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTT 
GTTAACTGCTCTTGCCATGAAGTTCACCCT 

CAAAAT C CGGGGCCT AAGAACAC CGTT ACCCAATGAGATCTTGGGTCCAGTC 

TGCCTCACCCCCCACTGGAGAGAGGCGGTTTCAGCCCCCAGAACCCCCGTCXrTCCTGGACTGGCATCCGAAATAC 
TACTCAGTTTGCTGCTGTGTGCCCCCAGCACCTGGATGAGAGATCCTTACTGCATGACATGCTGCCCATC 
TACCGCCAATTTGGATACTTTGATGACCTATGTTCAAGATCAAAATGAAGACTGCCTTTACTTAAACATCTACGT 
GC CCACGGAAGATGGAGCCAACACAAAGAAAAACGCAGATGATATAACX3AG TAATGACCGTGGTGAAGACGAAGA 
TATTCATGATCAGAACAGTAAGAAGCCCGTCATGGTCT 

CATGATTGACGGCAGCATTTTGGCAAGCTACGGAAACGT<^TCGTGATCACCATTAAC 
AGGGTTTTTAAGTACCG^TGACCAGGCAGCAAAAGGCAACTATGGG 

GATTGAGGAGAATGTGGGAGC CTTTGGCGGGGACCCCAAGAGAGTGACCATCT TTGG CTCGGGGGCTGGGGCCTC 
CTGTGT CAGCCTGTTGACCCTGT CC CAC TACT CAGAAGGTCTCTTCCAGLAAGGG CATCATTCAGAGCGGCACCX»C 
CCTGTCCAGCTGGGCAGTGAACTACCAGCCGGCCAAGTACACTCGGATATTGGC^ 
GCTGGACACCACG<3ACATGGTAGAATGCCTGCGGAACAAGAACTACAAGGAG 

GGCCACCTACCACATAGCCT TCGGGCCGGTGATCGACGGCGACGTCATCCCAGACGACCCCCAGAT CCTGATGGA 
G CAAGGCGAGTTCCTCAACTACGACATCATGCTGGGCGTCAACCAAG GGGAAGGC CTGAAGT TCGTGGACGGCAT 
CGTGGATAACGAGGACGGTGTG^CGCCC^CGACTTTGACTTCTCCGTGTCCAACTTCGTGGACAACCTTTACGG 
CTACCCTGAAGGGAAAGACACTTTGCGGGAGACTATCAAGTTCATGTACACAGACTGGGCCGAT 
GGAGACGCGGCGGAAAACCCTGGTGGCTCTCTTTACTGACCACC^GTGGGTGGCCCCCGCCGTGGCCGCCGACCT 
GCACGCGCAGT ACGGCTC CCCCAC CTACTTCTATGCCTT CTATCAT CACTGCCAAAGCGAAATGAAGCC CAGCTG 
GG CAGATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCG 

TTGTAACTTTTCCAAGAACGACGTCATG CTCAGCGC CGTGGTCATGACCTAC TGGACGAACTTCGCCAAAACTGG 
TGATCCAAATC^^CCAGTTCCTCAGGATACCAAOTTCATTCACACAAAA 

GTCCAAGT AT AATC CCAAAGACCAGCTCTATCTGCATATTGGCTTGAAAC CCAGAGTGAGAGATCACTACCX3GGC 
AACGAAAGTGGCTTT CTGGTTGGAACTCGTTCCTCATT TGCACAACTTGAACGAGATATTCCAGTATGT TTCAAC 
AACCACAAAGGTTCCTCCACXIAGACATGACATCAT TTC CC TATGGCACCCGGCGATCTC CCGCCAAGATATGGCC 
AACC^CC^AACGCCCAGCAATCACTCCTGCG^CAATCCG^^ 
GGAGkCAACTGTCCTCATTGAAACCAAACGAGATTATTCC^ 
GCTCCTCTTCCTCAAGATCTTAGCTTTTGCGGCGCT 

GCGC CCCAGTCCCCAGAGAAACAC CACAAATGATATCGCTCACATCCAGAACGAAGAGAT CATGTCTCTGCAGAT 
GAAGCAGCTGGAACACGATCACGAGTGTGAGTCGCTG CAGGCACACGACACACTGAGGCTCACC TGCCCGCCAGA 
CT ACACCCT CACGC TG CG CCGGT CGC CAGATGACAT CC CACTT A TGA C G C CAAACACCAT CAC CATG ATTC CAAA 
CACACTGACGGGGATGCAGCCTTTGCACACTTTTAACACCTT^ 

CGGACATTCCACCACTAGAGTATAGCTTTGCCCTATTTCCCTTCCTATCCCTCTGCCCTACCCGCTC^GCAACAT 
AGAAGAGGGAAGGAAAG AGA GAAGGAAAGAGAGAGAGAAAGAAAG T CTC C AGAC CAGGAATGTTTT TGTC C CACT 
GACTTAAGACAAAAATGCAAAAAGGCAGTCATCCCATCCCGGCAGACCCTTATCGT^ 
AAGATCAACTTCTGACCCTGTGAAATGTGAGAAGTACACAITTCTGTTAAAATAACrGCTTTAAG 
CTCCIAATCJAATGTTTAGTGTGATAGGACATCACCATTTC 

GACA CTT CTG AAACT CAG C CAAGGACAC TTGAT ATTTT TT AAT TACAATGGAAG TTTAAACATT T CTT T CTG T GC 
CACACAATGGATGGCTCT C CTTAAGTGAAGAAAGAGT CAATGAGATTTTGCCCAGCACATGGAGCTGTAATCCAG 
AGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGGACTGTGCAGCGAAATCTC 
GTGTTTTGCCAGCCTGAACTATATTTAAGAGACTTTGT 
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MLNSNVLLWLTALAIKFTLIDSQAQYPWNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 

PTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 

EDCLYLNI YVPTEDGANTKKNADD I TSNDRGEDED I HDQNS KKPVMVY I HGGS YMEGTGNMI 

DGS I LAS YGNVI VI TINYRLG I LGFLSTGDQ AAKGNYGLLDQ I QALRW I EENVGAFGGDPKR 

VTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVirYQPAKYTRIIiArJKYGCNML 

DTTDlWECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLimDIMLGV 

NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 

ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 

FGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 

WSK^PKDQLYLHIGLKPRWDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDM 

FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 

ASLLFLNILAFAALYYK^KRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 

S LQ AHDTLRLTC P PD YTLT LRRS PDD I P LMT PNT I TM I PNTLTGMQ P LHTFNT F S GGQNSTN 

LPHGHSTTRV 

Signal sequence: 
amino acids 1-24 

Transmembrane domains : 

amino acids 189-204, 675-692 
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GGGAAAGATCGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 

CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 

CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 

CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 

TGACCCAGTATATCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGG 

GTGCCATGTTTCCTGAGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAA 

GAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 

CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAAT 

GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 

CAGCTATGATCATGAGCGGGATGGGCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 

GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 

ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 

CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGGGATCTCTCAGATAATCATGATGTCA 

TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAGAGGAAAAGCTCCATCGA 

GATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 

CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 

TAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAAAGCGCTTCTACTGA 

GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 

GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 

TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCGCATGGTTGTGCATGGGGACATCTAACT 

CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 

TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 

AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 

AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 

TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 

TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 

GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 

TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 

GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 

AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 

GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 

GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 

TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 

GGGCTTGGATATTGCGCAGAGAAGAAATTTGGCTTTTTTTTTCTTAATGGACAAGAGACAGT 

TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 

CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 

TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 

AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 

TCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 

ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 
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FIGURE 153 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss -DNA5 0911 
xsubunit 1 of 1, 348 aa, 1 stop 
><MW: 39711, pi: 8.70, NX(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQ 
GVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALl^RVPCFLRDWELQVHFKIHGQGK^ 
LHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFPYISAMVNNGSLSY 
DHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRG 
YYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLS 
GLALFLI VFFSLVFS VFAI VIGI I LYNKWQEQSRKRFY 

Signal sequence: 

amino acids 1-38 

Transmembrane domain : 

amino acids 310-329 
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FIGURE 154 

CCGAGCCGGG CGCGCAGCGACGGAGC TGGGGCCGGCCTGGGACCATGGGCGTGAGTGCAATCTACGGAT CAGTCT 
CTGATGGTGGGTCGTTAACCTCAGTGGGGACTCCAAGATTTCCATGAAGAAAAT CAGTTGTCTTCATT CAAGAAT 
TGGGGT CTGGCT CAGAATTCCTGCAGCTGGTCAAAATCTGTTTTCT^ CTGCAGTCT 
GACAT GTT C C CGATT T GAGGTGAAAC CATG AAG AG AAAAT AGAATACTTAATAATGC TTTT C C GCAAC C G C TT CT 
TGCTGCTGCrGGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGGTGT 
CG ACTC C TAAGAATGGAAT GAGTAGCAAGAGTC G AAAGAGAATCATG CC CGAC C C TG T GACGGAGCC C CCTGTGA 

CGCATCATTT TAAG CTGGTCT CAGTGCATGTGTTCATTCGCCACGGAGACAGGTACCCACTGTATC 
AAACAAAGCGACCAGAAATTGACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCCAAAAC 
TTAGTCACATGTCAAAAGGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACXICAAATCACC 
CATTGTGTGAGATGGGAGAGCTCACACAGACAGGAGTTGTGGAGC^TTTGCAGAACGGTCA 

T CTATCTAAAGAAACACAAACTCCTGCCCAATCATTGGTCTGCAGACCAGCTCTATTTAGAGAC CACTGGG AAAA 
GCCGGACCCTACAAAGTGGGCTGGCCTTGCTTTATGGCTTTCTCCCAGATTTTGACTGGAAGAAGA^ 
GGCACCAGCCAAGTGCGCTGTTCTGCTCTGGAAGCTGCTATTGCXrCGGTAAGAT^CCAGTATCT 
AGCGTCXSTCAGTACCTCCTACGTTTGAAAAAC^GCCAGCTGGAGAAGAC^ 

ATGTCCC CA CCAAGCAGCTTAGAGCTGC CAACCCGATAGACT CC ATGCTCTGCCACTTCTG CCACAATGTCAGCT 
TTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTTCAAGGTAATTAAGACCCATCAGATCG 
GGGAAAGACGGGAGAAGAAATTGTACTT CGGGTATT CTCTCCTGGGTGCC CACCCCATCCTGAACCAAACCAT CG 
GCCGGATGCAGCGTGCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATO 
CACCAGTTCTCAGTGCCTTGGGCCTTTGAGAAGCCAGGTTCCCAAGGTTTGCAGCCAGGTTGATCT 
GGCAAGACAGAGAAAAGCCCAGTGAACATT C CGTC CGGATT CTTTACAATGGCGTCGATGT CACATTC CACACCT 
CTTTCTGCCAAGAC CAC CACAAGCGTTCTCCCAAGCC CATGTGC CCGCTTGAAAACT TGGT CCGCTTTGTGAAAA 
GGGACATGTTTGTAGCCCTGGGTGGCAGTGGTACAAATTATTATGATGCATGTCA 
T ATGGAGTAC AGGAGTATAGAAT CCATGCCAATACAGAGCATAGG 

TAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGGTTGGGGTTGAACAGT 

AAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACAAAATGGCCAGTTCAGAGAGGAATAGAAGGTAC^ 

TATCATAGCCAGACTTCGCTTAGAATGCCAGAATAATATAGTTCAAGACCTG^ 

TCTTCTGGCCTGCCCCATGTTACTATGTGATGGAACCAGC^CACCTCAACCAAAATTTTTTTAATC 

TTTACCT TGTC CTTGTTAAGAATTT CTTGAAGTGATTTATCT AAAATAAAGGTTGGCAAACTTTTTCTGTAAAGG 

GCCAGATTGTAAATATTTCAGACTGTGTGGACCAAAAGGCCACATACAGTCTCTGTCATAACT 

TTCTGAAGCAGGAAAGCCACCACAGACAGTACATAAAGGAA 

GATGGTGACCAGACTTGGCCC CTGGG CTGTAGTTTGCTGACCCCTCAT CTAAAAAATAGGCTATACTACAATTG C 
ACTTCCAGCACTTTGAGAACGAGTTGAATACCAAGAATTATTCAATGGTTCCTCCAGTAA 

CAGAATTTGGTCTGTAT CTGACACTAGAACAAAACTTGAGGGTAAATAAACATTGAATTAGAATGAAT CAT AGAA 
AACTGATTAGAAGAATACTTGATGTTTATGATGATTGTGGTACAAGATAGTTTTAAGTATGTTCTAAATATTTGT 
CTGCTGTAGTCTATTTGCTGTATATGCTGAAATTTTTC 

TTCTAAGACCAGTTTTAGATGACTC^TATTCCTGTAGTAATATTCAATTTGCTGTO 

GAGGCTAGAAGATGAATTCAGGCACTTT CTTCCAATAAAACTAATTATGGCTCATTC CCTTTGACAAGCTGTAGA 
ACTGGATTCATTTTTAAACCATTTTCATGAGTTTGAAATGGTAAAT 

AGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGGTGT^TTATATATTAGAAGCAATTATAAT 
TGATTTCTGAACTAAT GGTG CT AAT T CAGAG AAAT GGAAAGTGAAAGTGAGATTCT CTGTTGTC ATCGGCATT C C 
AACTTTTTCTCTTTGTTTTTGTCCAGTGTTGCATTTGAATATGTCICTTTCTATAAATAAATTTTT^ 
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FIGURE 155 

>< /usr / seqdb2 / ss t /DNA/Dnaseqs . min/ ss . DNA4 8 3 2 9 
xsubunit 1 of 1, 480 aa, 1 stop 
><MW: 55240, pi: 9.30, NX(S/T): 2 

MLFRNRFLLLLALAALLAFVSLSLQFFHLIPVSTPKNGMSSKSRKRIMPDPVTEPPVTDPVY 
EALLYCNIPSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLYVIPra?KRPEIDCTLVANRKP 
YHPKLEAFISHMSKGSGASFESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLK 
KHKLLPNDWSADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 
VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
TRNGCVDMEHFKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGRKEELF 
ALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSF 
CQDHHKRSPKPMCPLENLVRFVKRDMFVALGGSGTNYYDACHREGF 



Signal sequence: 
amino acids 1-18 
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AAAAAAGCTCACTAAAGTTTCTATTAGAGCGA^ 

GCTATTTAAGAGATAAAAACGAAATAT CCTTTCTGGGAGTTCAAGATTGTGCAGTAATTGGTTAGGACTC TGAGC 

GCCGCTGTTCACCAATCGGGGAGAGAAAAGCGGAGATCCTGCTCGCCTTGCACGCGCCTGAAGCACAAAGC^ 

AGCTAGGAATGAACGATCCCTGGGAGTATGTGGAAACAACGGAGGAG 

GGGCGAAGGAACTGCTCCTGACTTCAGTGGTTAAGGGCAGAATTGAAAATAATTCTGGAGGAAGATAAGAA 

TCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACCGG 

ATGCACCCAGATACGCTATTCAGTTCCGOAAGAGCTGGAGAAAGGCTCTAGGGTGGGCGACATCTCCAGGGACCT 

GGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGGAGTCCGCATCATCCCCAGAGGTAGGACGCAGCT^ 

GAATCCGCGCAGCGGCAGCTTGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCC^TCAAGTG 

T CAATT AAAT C TAG ACATT C TGAT GGAGGAT AAAG TGAAAATA TATGG AGTAG AAGTAGAAGTAAGGGACATT AA 

CGACAATGCGCCTTACTTTCGTGAAAGTGAATTAGAAATAAAAATTAGTGAAAATGCAGCCACTGAGATGCGGTT 

CCCTOTACCCCACGCCTGGGATCCGGATATCGGGAAGAACT^ 

CTTCTC CCT CATCGTGCAAAATGGAGCCGACGGTAGTAAGTAC CCCGAATTGGTGCTGAAACGCGCC CTGGACCG 
CGAAGAAAAGGCTGCTCACCACCTGGTCCTTACGGCCTCCGACGGGGGCGACCCX5GTGCGCACAGGCACCGCGCG 
GATCCXSCGTCATGGTTCTGGATGCGAACGACAACGC^ 

GGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACX3CTACCGACCCTGACGAAGGAGTCAATGCGGAAGT 

GAGGTATTCCTTCCGGTATGTGGACGACAAGGCGGCCCAAGTTTTOVAACTAGATTGTAATTCAGGGAC^ 

AACAATAGGGGAGTTGGACCACGAGGAGTCAGGATTCTACCAGATGGAAGTGC7UVGCAATGGATAATGCAGGATA 

TTCTGCGCGAGCCAAAGTCCTGATCACTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCXrrCACCTCTCT 

CGCCAGCTCGGTTCCCGAAAACTCTCCCAGAGGGAC^TTAATTGCCCTTT^ 

GGAAAACGGACAGGTGATCTGTTT CATC CAAGGAAAT CTGC CCTT TAAATTAGAAAAAT CTTACGGAAATTACTA 
TAGTTTAGTCACAGACATAGTCTTGGATAGGGAAC^GGTTCCTAGCTACAACATC^ 

GGGAACCCCGCCCCTATCCACGGAAACTCATATCTCGCTGAACGTGGCAGACACCAACGACAACCCGCCGGTCTT 

CCCTCAGGCCTCCTATTCCGCTTATATCCCAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCCCA 

CGACCCCGACTGTGAAGAGAACGC CCAGATCACTTATTCC CTGGCTGAGAACAC CAT CCAAGGGGCAAGCCTATC 

GT CCTACGTGTCCATCAACT CCGACACTGGGGTAC TGT ATGCGCTGAGCT C CTT CGACTACGAGCAGTT CCGAGA 

CTTGCAAGTGAAAGTGATGGCGCGGGACAACGGGCACCCGCCCCTCAGCAGCAACGTG 

GCTGGACCAGAACGACAATGOTCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGT 

GGCTCCCCX3CTCCGCAGAGCCCGGCTACCTGGTGAC(^U^GGTGGTGGCGGTGGACAGAGACTCCX3GCC^GAACGC 

C TGGCTGTCCTAC CGT CTGC T CAAGGCCAGCGAGCCGGGACT CTTCTCGGTGGGTCTGCACACGGGCGAGGTGCG 

CACGGCGCGAGCCCTGCTGGACAGAGACGCGCTCAAGCAGAGCCTC^ 

CCCTCTCTCCGCCACTGTCAOSCTCACCGTGGCCGTGGCCGA^^ 

CCTCGAGTCTCCAGCTAACTCTGAAACCTC^GACCTC^CTCTGTACCTGGTGGTAGCGGTGGCCGCGGTOTCCTO 
CGTCTTCCTGGCCTTCGTCATCTTGCTGCTGGCGCTCAG^ 

TTCAGGAGGCGGCTTGACAGGAGCGCCGGCGTCGCACTTTGTGGGCGTGGACGGGGTGCAGGCTTTCCTGCAGAC 
CTATTCCCACGAGGTTTCCCTCACC^CGGACTCGCGGAAGAGTC^CCI^ 
CATGCTCGTCAGCGAGGAGAGCTTTGAAAAAAGCGAGCCC CTTT TGCTGTCAGGTC 

CAGTCATGGGTTAATTGAGGTGAGTTTATATCAAATCTTCTT^CTTTTTTTTTTTAATTGCTCTGTCTCCCAAGC 
TGGAGTGCAGCG GTACGAT CATAGCTCACTGCGGCCTCAAACT C CTAGGCT CAAGCAATTATCCCAC CTTTGCCT 
CCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTATCTATCTATCTATCTATCTATCTAT 

CGAGGCGGGTGGATGACCTGAGGTTGGGAGTTTGAGACCA<3CCTGAC 

AAAAATACAAAATTAGCCGGGCGTGGTGGTGGATGTCTGTAATCCCAGCTACTTGGGAGGCT 

TGCTTT AAC CTGGGAGGTGGAGGTTGCAATGAG CTGAGATTGTGCCATTGCAC TCCAGCCTGGGCAACAAGAGT G 
AAACTCTATCTCA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48306 
xsubunit 1 of 1, 916 aa, 1 stop 
><MW: 100204, pi: 4.92, NX(S/T): 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEEIiEKGSRVGDISRDLGLEPRELAER 
GVRIIPRGRTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDILMEDKVKIYGVEVEVR 
D INDNAPYFRESELE I KI SENAATEMRFPLPHAWDPD I GKNS LQS YELSPNTHFSLI VQNGA 
DGS KYPELVLKRALDREEKAAHHLVLTASDGGDPVRTGTARI RVMVLDANDNAPAFAQPEYR 
ASVPENLALGTQLLWNATDPDEGVNAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 
ESGFYQMEVQAMDNAGYSARAKVLITVLDVNDNAPEVVLTSIjASSVPENSPRGTLIAIjLNVN 
DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHISLJTVADTiroNPPVFPQASYSAYIPENNPRGVSLVSVTAHDPDCEENAQITYSLAENTI 

QGAS LS S YVS IN SDTGVL YALS S FD YEQFRDLQVKVMARDNGHP PLS SN VSLS LFVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEVRTARALIjDRDALKQSLVVAVQDHGQPPLSATVTLTVAVADS I PQVLADLGSLESPA 
NSETSDLTLYLWAVAAVSCVFLAFVILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEBCSEPLLLSGDSVFSKDSHGL 
IEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
IYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEVGSLRPA 

Signal sequence: 

amino acids 1-30 

Transmembrane domains : 

amino acids 693-711, 809-823, 869-888 
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FIGURE 158 

CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCTGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 

CTGGTTC 
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FIGURE 159 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA4833 6 
<subunit 1 of 1, 26 0 aa, 1 stop 
<MW: 28048, pi: 7.87, NX (S/T) : 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PWQS I PHPC YNSSDVEDHNHDLMLL 
QLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 
amino acids 1-23 

Transmembrane domain : 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site, 
amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site, 
amino acids 182-188 

Kringle domain proteins motif 

amino acids 205-217 
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FIGURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 

SScSg^Sgccccaaccccggcccgcgccccctagcccccgcccgggccotcgcccgc 

GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 
CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 
ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 
CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 
GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 
TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 
CCCGCCCAACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 
CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 
GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 
CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 
CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 
GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 
GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 
TGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 

Stgcaggacaacgcgctgcaggcactgcctgatgacaccttccgcgacctgggcaacctca 

CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 
CACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTT 

cJotga^ttcgccgcctcatgacactctatctgtttgcc^caatctatcagcgc^cca 
Sgaggccctggcccccctgcgtgccctgcagtacctgaggctcaacgacaacccctcggto 

TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 
^GTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 

GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 

ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 
CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 
CCCCACCTCGGGCCCTCGCCGGAGGCCACMCTGTTCACGCAAC^CCGCACCCG^ 
G^CGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGGA^ 

CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 
TGGGCCCTGCTGACCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 
GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 

Stcctccctgatggacgcctgccgcccgccacccccatctccaccccatcatgtttacaggg 

ScGG?G^GCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCC^TATATAGAGATAT 

gcattttattttacttgtgtaaaaatatcggacgacgtggaataaagagctcttttcttaaa 

AAAA 
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FIGURE 161 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
><MW: 50708, pi: 9.28, NX(S/T): 6 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGUJAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLAIiLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEAIjAP 

LRALQYLRLNDNPVrtfCDCR^^ 

VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 

GSGGGGTGDSEGSGALPSLTCSLTPLOLALVI1WTVI1GPC 

Important features: 
Signal peptides 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 13 5-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site • 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 



VWFC domain 

amino acids 411-425 
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FIGURE 162 

GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAATGGAGGCCAGCGGGA 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 
C AATTTAG CAAAGGAC C TGGGTC TGGAGC AGAGGGAATTCT CC AGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGT 
GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 
ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 
ACTACGTTTCCTCTGAAGAATGCCGAAGACTTAGATGTAGGCCAAAACAATATTGAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 
CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGAC AGTC CGGT AGGCTT C CTGGTTGTGAAGGT CTCTGC C ACGGATGTAGACACAGGAGTC 
AACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 
CAATCCCTTGACAGGAGAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 
ATGAAGT CAAT ATTGAGGCAAGAGATGCTGGAAC CTTTT CTGGAAAATGCAC CGTT C TG ATT 
CAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 
CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 
CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 
GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCAG 
CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 
CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 
CTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCCTGGCTGTCGTACCA 
GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 
CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 
AATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 
CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 
TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 
GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 
GGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 
AGTATGAGGTGTGTCTGGC AGGAGGCTCAGGGAC C AATGAGTTCAAGTT CCTGAAGC CGATT 
ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 
TAACTTTGGGTT C AAT ATTCAGTGACC ATAGTTGAC TTTT ACATT CCAT AGGTATTTT ATTT 
TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 
TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 
CCTGGTTCTT 
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FIGURE 163 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48314 
oubunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 
MEASGIGIjICRQRQVLFSFLLLGLSIiA^ 

RGVRWSRGNKLHLQLNQETADLLLNEKLDREDLCGHTEPCVLRFQVLLESPFEFFQAELQV 
IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQI S EDS PVGFLWKVSATDVDTGVNGE I SYS LFQAS EE IGKT FKINPLTGE I ELKKQLDF 
EKiQSYEVNIEARDAGTFSGKCTVLIQVIDVNDHAPEVTMSAFTSPIPENAPETVVALFSVS 
DLDSGENGKI SCS IQEDLPFLLKS AENFYTLLTERPLDRESRAE YNI T I TVTDLGTPML ITQ 
I^MTVLIADVNDNAPAFTQTSYTLFVREI^SPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVSINADNGHLFALRSLDYETOjQGFQFRVGASDHGSPALSSEALVRVWLDANDNS 
PFVLYPLQNGSAPCTELVPRAAEPGYLVTKVVAVDGDSGQNAWLSYQLLKATELGLFGVWAH 
NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVLLVDGFSQPYLPIiPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQS YQYEVCLAGGS GTNE FKFLKP 1 1 PNF P PQCPGKE IQGNST FPNNFGFN I Q 

Important features : 
Signal peptide: 
amino acids 1-26 

Transmembrane domain: 

amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP -binding site motif A (P-loop) • 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 
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FIGURE 164 

ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 
GCCGATTGCCTCTCGGCCTGGGCAATGGTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 
GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGG 
AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 
GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCX5ATGGGCCAGGACAGGGCAGC 
AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 
TGATTCCTGGGGAAGCTGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 
GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 
AGCACACTTCCCTGACAGAGAAGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 
CAGC C C CGACAGAGGACT C CAATAAC ACTGAAAGT CTGAAATC CCC AAAGGTGAACTGTG AG 
GAGAGAAAC ATTACAGGATT AGAAAATTT CACT CTGAAAATTTTAAAT ATGT C ACAGGACCT 
TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 
GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 
CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 
TGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 
GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 
GTGGTGGTAACT CAAGC CG AC C AAAT AGGCC C TCTTCCCAGCACTTTGATAAAAAGTGTGGA 
CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 
CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGAGTAGTGATGGTCT 
GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 
C ATTTTCTCCAGTGACGTGTTGACTTGAAACTT C AGGCAGATTAAAAGAAT C ATTTGTTGAA 
CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 

AAAAATATTCAATAG 
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FIGURE 16 5 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48333 
xsubunit 1 of 1 , 360 aa, 1 stop 
><MW: 39885, pi: 4.79, NX(S/T): 7 

^PAAGRRPPRVMRLLGWWQVLLWVLGLPVRGVEVAEESGRLWSEEQPAHPLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGIJDTQ^ 

NVRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKIIiNMSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
QHS SLSTRFGTVAVPN I LLFQGAKPMARFNHTDRTLETLKI FIFNQTG I EAKKNVWTQADQ 
IGPLPSTLI KS VDWLLVFS LF FL I S F IMYAT I RTE S I RWLI PGQEQEHYE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain : 

amino acids 321-340 

Homologous region to dilsufide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 



Thioredoxin domain 

amino acids 211-227 
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FIGURE 166 

CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATSCTG CAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
TGAAGC CTGC AAAAAT AAAAATGATGATGACAACGACATAATGGAAACG C TTTGTAAAAATG 
ATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 
CTGG AGACCAAGAGCAAG ACCATTT ACAAG CTGAACGGTGTGTC CGAAAGGGACCTGAAGAA 
ATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 
CGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 
TGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTA 
GTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 
GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GATAGCTGTTTTCACCTAAAGGAAAAG C C CAC C CGAATCTTGT AGAAAT ATT C AAACTAATA 
AAATCATGAATATTTTAA 
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FIGURE 167 

>< /usr / seqdb2 /ss t /DNA/Dnaseqs . min/ss . DNA5 0 92 0 
xsubunit 1 of 1, 295 aa, 1 stop 
><MW: 33518, pi: 7.74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGIiFLFGQPDFSYKRSNCKPIPVNLQIjCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWIPLVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNDLCIPLASSDHLLPATEEAPKVCEACKNKNDDDNDIM 
ETLCKNDFALiKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EMNDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRI SRS IRKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 
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FI GURE 16 8 

GTGGAGGCCGCCGACGATGGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 
CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 
AGCCTGCTCAACTGCTCCAACGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 
TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 
TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCG 
ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 
TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 
AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 
GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 
GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 
CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 
CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 
CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGGTTGGGGGGAATGATCGGGATCTCTG 
CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 
GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 
TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 
CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 
TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 
GTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 
AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 
TCCACCTGCCAGCAGGGGGAGGATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 
CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 
AGGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 
GGTGTGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 
GTGCACGGCGAGGGGGGCCTCGCTAGAGGACCCCAGAGGGCCCGGGAGCCCCCACCCAGCCT 
GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 
AGACTCGCAGGCAGGGTCCAAGCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 
CTCCTCCCCGTGGGTGATCACG TAG CTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 
TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 
CCGGGAGCGAATTACAAGCGCGCACCTGAAAA 
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FTGURE 16 9 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA50988 
xsubunit 1 of 1, 560 aa, 1 stop 
><MW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGLAEPRALCAQRGHRTYARRWVFLLAISLLNCSNATLWLSFAPVADVIAEDLV 

LSMEQ I NW L S LVYLWSTP FGVAAI W I LDS VGLRAAT I LGAWLNFAG S VLRMV PCM WGTQN 
PFAFLMGGQSLCAI^QSLVIFSPAKIjAALWFPEHQRATANMLATMSNPLGVLVANV 
KKGED I PLMLGVYTI PAGWCLLST I CLWESVPPTPPSAGAASSTSEKFLDGLKLQLMWNKA 
YVII^VCLGGMIGISASFSALLEQILCASGHSSGFSGLCGALFITFGIIiGALALGPYVDRTK 
HFTEATKIGLCLFSLACVPFALVSQLQGQTLALAATCSLIjGLFGFSVGPVAMEIjAVECSFPV 
GEGAATGMIFVLGQAEGILIMLAMTALTVRRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFF 
SCILAVFFHTPYRRLQAESGEPPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 
ASLED PRGPGS PHPACHRAT PRAQG PAATDAPS RPGRLAGRVQASRF I DPAGSHS S FS S PWVI T 

Important features : 
Signal peptide: 
amino acids 1-44 

Transmembrane domains : 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site, 

amino acids 40-43 and 43-46 

Glycosaminoglycan attachment site. 

amino acids 468-471 
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FTGURE 17 0 

GTCCCACATCCTGCTCAACTGC^TCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCT^GT^CC^C 

TRGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGG 

^CCTGCCOTGGGGTCCTGCTTGTTTCATJ^TCATCTAAC^^ 

AAGGAGcScGGGGCTGATCAAGC^ 

T^TGAATCTAGCCCACTTGGCGXaTAAGC 

CTAOTATCTCTTTTAG^ 

J^CTtMTACAGTGATCGGGAAGC 

ccScttcSotgc^ 

GOTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCT 

Sctga^cca^g^S 
otggaaa"™tgagagcg^ 

taaca^tc^cLctacactctgtctcccagtgagcactttgccttggatgtcattgtgggccctgatgagac 
caaa^tgS^ct^ 

^^ca^tgggaaccccccc^gtcaggtaccagcttggt 

cI^g^tgc^ga^c^ 

gaccg^cIcXgacc^c^gg^^ 

g^otacaot^gtattgatgccaagacaggccaggtcattctgcgtcgacctctagactatgaaaagaaccc 

^ctacgaggtg^^ 
g^tcStctcaatgacaacatcccaagcat^ 
tcttcc^ggacagttttattgctctt^ 
otggctgagccaagagctgggccacttcaggctgaaaagaac 

cacactxsgacagagagcagtxmccc 
^caagaaa^ctcag^ctcagatcagtgacatcaa 

AGTC^CAC^OTGGAAAACAAC^CCCrCTCTTCACCT 

TAATGGAAAAGTC^TC^TACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTC ■ 

gotcac^Sgaggtcactgaactatgaagagatggccggctttga^ 

G^C^TG^TCCAT^ 

cca^cctc^cagcgatggaaaagcctvgcctctccgtgcttgtgaatgcctccacag^ 

rATCGAGA^^^C^CAAT^GCTTGGGCCCAGCGGGCACTGACACACCTCC^CTGGCCACTCACAGCTCCCGGCCATT 

AAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCftGCTGTTCGTCAATGTCACCA 

CATTGGGAGTGAGTGGGAGCTGGAGATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAG 

GAGGGTCATGTTTGTCACCAGTGTGGACCACCTGAGGGACTCAGCCCGGAAGCCTGGGGCCTTGAGCATGTC 

GC^OGGTGATCTC 

ga^gaaaagaSc^^ 

CCA^AAACACATTCAGAAGGCAGACATCCACCTCGTGCCTGTGCTCAGGGGTC^ 
CGGGCAG^C^ACAAAGATGTGGACAAGGAGGCGATGATGGAAGCAGGCTGGX^CCCCTGC 

CCA^CTcIc^GA^cS 

rr^CCAM^CACGGTCAACCTCCTTTTCAACCA^ 

CGAG^CAG^CTGC^ 

TOTAG^C«G^^TCMGAAG^CCCACAGAGGCCACCAGC 

CAATG^CAAAGTGTCCCCTGAGAAAGAATCAGXjGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCT 

TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTC^ 

G^TGC^G^CA^CCA^ 

cIgcaggagtg^^g^^ 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCT 
r^ArAMTCTGGCCCTGGACCraGCTGAGCGCCCCTGACCOSGCCTGGATGGp^ 

Sc^aa^ccgtoacaa^™tccccggatgctgcagccacgg 

CAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTCTCGGAGATGAGCTCA 
^GGAGATG^CTGG^ 

c^cg^gga^cc?ca™agacttggccaccagtccagcctcag^ 
aaaga^gIa^a^c^^ 

ctggatccaagaaccaggggcctgaggatctgtggacaagagctggtttctaaaatcttgtaactcact^ 
cggcggcctgagaactttagggtgactgatgctacccccacagaggaggcaagagccccaggact 

tgaccaaag^gccc^ 
tcctScaaaacatatgtggagcacaaagggt 

caataaaggaaaagcagtaaaaaaaaaaaaaaaaaaaa 
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FIGURE 171 

< /usr / seqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA4 8331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKIjSQELGREERRRQA 
GAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQ 
VLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 
GPDETKHAE L I WKELDRE I HS FFDL VLTAYDNGNP PKSGTSLVKVXTVLD SNDNS PAFAES S 
LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPL 
D YE KNPA YE VDVQARD LG PN P I PAHCKVL I KVLDVNDN I P S I HVTWASQ PS LVS EALP KD S F 
IALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 
QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 
YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLIjDA 

ndnapewqpvlsdgkaslsvlvnastghllvpietpnglgpagtdtpplathssrpflltt 
ivardadsgangeplysirngneahlfilnphtgqlfvtstvtnassligseweleivved 
pplqtrallrvmfvtsvdhlrds arkpgalsmsmltvi clavllgi FGL I LALFMS I CRTEK 

KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 
GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 
QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 
I LRSLVRLS VAAFAERNPVEELTVDS PPVQQI SQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 
PAWMARLSLPLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 
LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCD 

Important features : 
Signal peptide : 
amino acids 1-13 
Transmembrane domain : 
amino acids 719-739 
N-glycosylation site . 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 
Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGURE 172 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 

CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 

GCAGGAGCCTTCCTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCT 

CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 

GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 

GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 

GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 

TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 

ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 

AAC ATGGGAT CTT ATC CATAGAACAGCTCATCAGCCGGGTTGGTGTGAT TGGAGTGACT CTC 

ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 

CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 

TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 

GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 

AAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 

TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 

AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 

CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 

TTGAGAT C ACTGTGAATTAT CTGGGAATCCAATTTGATGTG AAGTTTTGGTCCCAACACATT 

TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 

CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 

AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 

TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 

TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 

AGGCACCAGAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGT 

GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGGAAC CAGGGC CTGACATTTTATAAAC 

AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 

CTATGAC C ATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAAC TCAAGACAATACT CA 

GCAG AGAGC ATC C CGTGTGGAT ATGAGGCTGGTGT AGAGGCGG AGAGGAGCCAAGAAACTAA 

AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 

AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 

AAAATC AGAGACTGTAAC AAAAAAAAAAAAAAAAAAAAAGGGCGGC CGCGACTCTAGAGT CG 

ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 173 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEI 
LGVLNSSSRYFHWKMKLCVIL.LILVFMVPFYIGYFIVSNIRLLHKQRLLFSCIiLWLTFMYFF 
WKLGDPFPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDI 
LALERRLLQTMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTIilQ 
QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 
SKSSWIVLLIAQIMGMYFVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSA 
LSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites . 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 174 

CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTG(3AAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGT CTCTTATGGCTGAC CTTTATGT ATTTCC AG 
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FIGURE 175 

GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 
TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 
NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 
TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTC ATGGTGCCTTTTTACATTGG C TATTTTATTGTGAG C AAT AT C CGACTACTGC A 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 

TAGGAGATCCCTTTCCCATTCTC 
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FIGURE 176 

CTCGCGCAGTCATCGTCCCATGGCCG^ 

tggctccctgctcgtcgaactgctcttctcacgggctgtcgccttcaatctggacgtgatgggtgccttgcgcaa 

ggagggcgagccaggcagcctcttcggcttctctgtggccctgc^ccggcagttgcagccccgaccccagagctg 

gctgctggtgggtgctccccaggccctggctcttcctc 

cccgttgagcctggaggagactgactgctacagagtggacatcx5accagg 

ggagaacc^gtggttgggagtcagtgttcggagcc^^ 

tgaggcaaggcagcgagtggaccagatcctggagacgcgggatatgattggtcgctgctttgtgctcagccagga 

cctggccatccgggatgagttggatggtggggaatggaagttctgtgagggacgcccccaaggccatgaacaatt 

tgggttctgccag<^gggcacagctgccgccttc^^ 

ctataattggaagggcacggcc^gggtggagctc^ 

tccctac^ggcggggggagagaaggagcaggacccccgcctcatcccggtccctgccaacagctactttggctt 
ctctattgactcggggaaaggtctggtgcgtgcagaagagctgagctttgtggctggagccccccgcgccaacca 

caagggtgctgtggtcatcctg^^ 
cctgacctccggctttggctactcactggctgtggct^^ 

tgccccctacttctttgagcgccaagaagagctggggggtgctgtgtatgtgtacttgaaccaggggggtcactg 

ggctgggatctcccctctccggctctgcggctcccctgactccatgttcgggatcagcctggctgtcctggggga 

cctcaaccaagatggctttccagatattgc^gtgggtgccccctttgatggtgatgggaaagtcttc^tctac^ 

tgggagc^gcctgggggttgtcgccaaaccttcacaggtgctggagggcgaggctgtgggcatc^gagcttcot 

ctactc<x:tgtcaggcagcttggatatggatgggaacc^t^^ 

cgc^gtgctcttcagggccagacccatcctcc^tgtc 

ggagcagcccaactgtgctggcggccactcggtctgt 

cagcagctatagccctactgtggccctggactatgtgttagatgcggacacagaccggaggctccggggccaggt 

tccccgtgtgacgttcctgagccgtaacctggaagaacccaagcaccaggcct cgggcac cgtgtggc tgaagca 

ccagcatgaccgagtctgtggagacgccatgttccagctccaggaaaatgtcaaagacaagcttcgggcc^ 

agtgaccttgtcctacagtctccagacccctcggct 

ccccatcctcaatgcccaccagtccagcac^ 

CAAGATCTGCCAGAGCAATCTGCAGCTGGTCCACGCCCGCTTCTGTACCCXjGGTCAGCG^ 

TCTGCCCMGGATGTGGATGG^C^CAGCC^^ 

GGTCACCAACCTGCCATCGGACCCAGCCC^GCCCCA<^^^ 

CATGCTTCCTGACTCACTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGAAGCCACTCTGCCTGTCC^A 
TGAGAATGC CTC CCATGTTGAGTGTGAGCTGGGGAACCCCATGAAGAGAGGTGC C CAGGTCACCTTCTAC CTCAT 
C C T T AGCAC C T C CGGGAT CAGCAT TGAGAC CACG GAAC T GGAGG T AGAGCTGCTG TTGGC CACGATCAGTGAGC A 
GGAGCTGC^TCCAGTCTCTGCACGAGCCC^ 

C CA G C AACTCTT CTTCT CTGGTGTGGTGAGGGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGCAGCAAGGT 
CAAGTATGAGGTCACGGTTTCCAACCAAGGCCAGTCGCTC^ 

GCCTCATGAGATTGCCAATGGGAAGTGGTTGCTGTACCCAATGCAGGTTGAGCTGGAGGGCGGGCAGGGGCCTGG 
GCA GAAAGGG C T TTGCT CTC CCAGG C C CAAC ATC CT C CAC C T G GAT GTGGACAG T AGGGAT AGGAGGCGGCGGGA 
GCTGGAGCCACCTGAGCAGCAGGAGCCTGGTGAGCGGCAGGAGCC CAGCATGTCCT GGTGGCCAGTGT CCTCTGC 
TGAGAAGAAGAAAAACATCACCCTGGACTGCGCCCGGGG<^C<^ 
CAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGC^^ 

TGTGAAGTC C CTGGAAGTGAT TGTC CGGGCCAACAT CACAGTGAAGTCCT CCATAAAGAACTTGATGCT CCGAGA 

TGCCTCCACAGTGATCCCAGTGATGGTATACTTG<^^ 

CATCCTCCTGGCTGTACTGGCTGGGCTGCTGGTGCT^ 

CAAACGGGCGAAGCAC C C CGAGGCCAC CX5TGCCCCAGTACCATGCGGTGAAGATTCCTCGGGAAGACCGACAGCA 
GTT CAAGGAGGAGAAGACGGGCACCAT C CTGAGGAACAACTGGGGCAGCCCCCGGCGGGAGGGCCCGGATGCACA 
CCCCATCCTGGCTGCTGACGGGCATC CCGAGCTGGGCC CCGATGGGCAT C CAGGGC CAGGCACCGCCTAGGTT CC 
CATGT CCCAGC CTGGC CTGTGGCTGC C CTCCAT CC CTTC CC CAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGT 
GGGCTGOTGGTGTCK3CATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACA 

TCCTCCCACCCAACTTC C C CTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGC CATGGGGTAGGG 
TGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGATCCT^ 

GTAACAGGACC CCAAGG AC CTG C CTCC C CGGAAGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGA 
CTCAGGCTGCTCOreCTCTAGTTTCCCCTC 

TTCGTCTATTTATTAAAAAATATTTGAGAACAAAAAAAAAAAAAAAAAAAA 
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FIGURE 177 

></usr/seqdb2/sst/DNA/Dnaseqs . min/ss .DNA55737 
xsubunit 1 of 1, 1141 aa, 1 stop 
><MW: 124671, pi: 5.82, NX(S/T): 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLPGFSVALHRQL 
QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWL 
GVSVRSQGPGGKIVTCAHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 
RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADLAHLDDGPYEA 
GGEKEQDPRLI PVPANSYFGFS IDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLV 
PEVMLSGERLTSGFGYSIAVADLNSDGWPDLIVGAPYFFERQEEIiGGAVYVYLNQGGHWAGI 
S PLRLCGSPDSMFGI SLAVLGDliNQDGFPD I AVGAPFDGDGKVFI YHGS S LG WAKPSQVLE 
GEAVGIBCS FGYSLSGSLDMDGNQYPDLLVGSIiADTAVLFRARPILHVSHEVSIAPRSIDLEQ 
PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 
HQASGTWLKHQHDRVCGDAMFQIiQENVKDKLRAIVVTLSYSLQTPRLRRQAPGQGLPPVAP 
ILNAHQPSTQRAEIHFLKQGCGEDKICQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 
LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 
ENASHVECELGNPMKRGAQVTFYLI LSTSGI S I ETTELEVELLLATI S EQELHPVS ARARVF 
IELPLSIAGMAIPQQI,FFSGVWGERA^3QSERDVGSK^KYEVTVSNQGQSLRTLGSAFLNIM 
WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 
RQEPSMSWVIPVSSAEKKKNITLDCARGTANCVVFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 
SAVKSLEVI VRANITVKSS IKNLMLRDASTVIPVMVYLDPMAVVAEGVPWWVI LLAVLAGLL 
VLALLVLLLWKMGFFKRAKHPEATVPQYHAVKIPREDRQQFKEEKTGTILRNNWGSPRREGP 
DAHPI LAADGHPELGPDGHPGPGTA 



Important: features : 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 
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FIGURE 17 8 

CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 
AAGC AGCGAGTTGGCAGAG CAGGGCTGCATTT CCAG CAGGAGCTGCGAGCACAGTGCTGGCT 
CACAACAAGATQCTC AAGGTGTCAGCCGTACTGTGTGTGTGTGCAGC CG C TTGGTG CAG TCA 
GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 
TGGATG AT AAACAATGGCT C AC C AC AATCTCTCAGTATGACAAGGAAGTCGGACAGTGGAAC 
AAATTCCGAGACGAAGTAGAGGATG ATTATTT CCG C ACTTGGAGTCC AGGAAAAC CCTT CGA 
TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 
GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 
AAAG AAGCAGG AGTAGACCATAGG CAGTGG AGGGGT CCCATATTATC CACCTGCAAGCAGTG 
CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 
AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGGACATTGC 
C CATGT CCT T CAGAT AAG CCC AC CAGTACAAGC AGAAATGTTAAGAGAGCATGCAGTGACCT 
GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 
GTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATC 
TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 
GCTATTGGACCAGTCAGAGCT CAGAAGC ATTTAC CTTGAT AAGAATG AACAGTGT ACCAAGG 
CATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 
TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 
AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 
CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 
ATGGGATC CAG AATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAG ATCTC CGGAGATTT 
TGC TAGTGGC G ATTTTC ATGAATGGACTGATGATG AGGATGATGAAGACGATATT ATGAATG 
ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 
CATGATGTATACATTTGATTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 
CAAAAATGATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 
CATATATTTTGTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 
AAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTA 
GTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 
AAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 
ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 
AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 
GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 
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FIGURE 17 9 

></usr/seqdb2/sst/DNA/Dnaseqs .tnin/ss . DNA49829 
xsubunit 1 of 1, 436 aa, 1 stop 
><MW: 49429, pi: 4.80, NX(S/T): 0 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKFR 
DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHRMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTS I LPI 
CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISI^EVJCYCFQ 
RQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMl^EDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 246-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 
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FIGURE 180 

CAGACTCCAGATTTCCCTGTCAACCACGAGG^ 
CTCTTTCACXICCGGGATCGCCCCAGC 

GCTCTGCCTCCGGTGCTGCrGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 

CCCGCCGGCCAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTA 

GATGGAGCAGGATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGAA 

TCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTAC^ 

TCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAA 

AAATATATTACTGGCACAGATATATTGGATATGAAACTGGAAGACA.TCCTGGAATCCATCAACA 
AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAA^ 
AACTTTGATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAAGCT 
ATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAACTTMAACTCC^ 

AATGAGGCAT AAAAATGCAATAAACTGTT ACAGTCAAGACCATTAATGGT CTT CT CCAAAATATTTTGAGAT AT A 

AAAGTAGGAAACAGGTATAATTTTAATGTGAAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAG 

TTGTACTTAAGTGTGTAACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCAT 

TTTCCTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTA 

AGTCTGTTTTTAACAGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGTTACAAATTAACTGTGGAAGT 

TCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGATGGATTGAATAAATCTTTAGACTACAAAAGCCCAA 

CTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 

AGATTTTTATAACCAAATACAT TTCAGTG TAACATATTAG CAGAAAGCATTAGTCTTTGTACTTTGCTTACATTC 

CCAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACT 

AAATG AAGAAT AT AGTT TAAAAGCT T CC T CC T C CAT AGGGACACAT TT T CT CTAACCCTTAAC T AAAGTG TAGGA 
TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGT ATCTGTCAAGTTAAT AT CTGT CAACAGTTAA 
TAATC^TGTTATGTTAATTTTAACATGATTGCTGACTTGGATAAT^ 

TTGCTAAAATGATCTGGGCCTAC(^TAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGG^ 

AGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTTATAAATAAATATCTAGA 

ATC TGAC TGGCT CAT CATGACATC CT AC TCAT AAC ATAAAT CAAAGGAGATGAT T AAT T T C CAGTTAGCTGGAAG 

AAACTTOGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTGAATTATTTTTGTAAGC^ 

AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 

TAAAATGGCCTTTCTGAACACTTTATTTATTXaATGTTGAAGTAAGGATTAGAAACATAGACT 

CAC CTAAATGTGAATAAC C CAT ATAT ACAACAAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGT CTTAC 
TOU^GTACTAGTAATTTAACTTCATCATGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTC 
GACTACATTGTGAGTTAGAAACAAACTTAAAATTTGGGGTATAGAACC C CTCAA CAGGTTAGTAATGCTGGAATT 
CTTGATGAGCAATAATGATAAC CAGAGAGTGATTTCATTTACACT CATAGTAGT ATAAAAAGAGATACATTTC CC 
TCTTAGGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGT 
ATGATCAATTACCTTAATTGGCCAAGAAAATGCTTCAGGTGTCTAGGGGTATCCTCT 

AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACA 
GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCATCTTTTCTATA^ 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTC CTTAC CTAACAATGAATTTGTTACATAAT CTTCT 
ACATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTAC^ 

TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAGAGAGATATTTAAG 

CAT AAC CAAAAAAGCAAAACTTGTAAACAGAGT AAAAAT CTTTAATATTTCTAAAGACATACTGTTTATC TGCTT 

CATATGCTTTTTTTAATTTCACTATTCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTT^ 

AACAGCTCATTTTGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCA^ 

CATAATGTAGCAGTTACCGTGTTCACCTCAGACTAAGGCCTAGAGTTTGCTCTGA 

GTTATGCTGTTCTTTCATGTGAATGTX^AAGACATGGAGGGTGTTTGTAATTTTATGGTAAAA 

CACATAATGGTGTCTTAAAATTGAGAAAAAATGAGCACTTAGAATTGTATGTCTCCT 

GTGAAATTTTAAAAGACATTGATTCCGCATGTAAGGATTTTTCATCTGAAGTAQ^ATAATGCACAATCAGTGTTG 
CTCAAACTGCTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATAATAA 
AAATT AT CAAAGGAAAA 



Patent Application Publication Mar. 27, 2003 Sheet 182 of 237 US 2003/0060406 Al 

FTGURE 181 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA52196 

xsubunit 1 of 1, 229 aa, 1 stop 
><MW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 
QVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFEL 
ILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRN 
IQESNFDRVNFWSMVNLWMVWSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-4 8 



Tyrosine kinase phosphorylation site. 

amino acids 55-62 
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FTGtTRE 182 

CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGAGTCGCAGACACTAiSCTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 

T AAC AGCT ACT C ATACAT CTGGATTGGGC TCC ATGACC C C AC ACAGGGCT CTGAGC CTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGACTAgGGCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 183 

</usr/seqdb2/sst/DNA/Dnaseqs .rain/ss . DNA56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW: 19330, pi: 7.25 r NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQK^PSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVTyiNYFAWEKKPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFro 

Important features: 
Signal peptide: 

amino acids 1-26 

C- type lectin domain signature. 

amino acids 146-171 
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FIGURE 184 

CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 
ACAGAGGAGTGGGCCGGGACCATGCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 
TGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 
CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTAC 
TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 
GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 
CTCCCACTCTTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 
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FTOTTRE 185 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56405 

<subunit 1 of 1, 12 5 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T): 1 

MRGTRLALLALVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCNTELCNVDGAPALNSLHCGALTLLPLLSLRL 

Important features: 
Signal peptide: 

amino acids 1-17 



N-glycosylation site. 

amino acids 4 6-49 
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FIGURE 18 6 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAQACCATTGACA 
ACTAC CAGC CGT AC C CGTGCG C AGAGG ACGAGG AGTGCGGCACTG ATGAGT ACTGCGCT AGT 
CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 

C AT AGC AC CTTGGATGGGTATT C C AGAAGAACC ACCTTGTCTTCAAAAATGTATCACAC CAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 

AATAAAATTTAACATTTAAAAAAAAAAAAA 
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FIGURE 187 

</usr/seqcflb2/sst/DNA/Dnaseqs.min/ss.DNA57530 
<subunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T): 1 

MmLGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICIiACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 

KDHHQASKS SRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site . 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 
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FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTGAAATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 
GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 
GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACC AGC CGTACC CGTGCGC AGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATG CG TCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 

CATAGCACCTTGGATGGG 
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FIGURE 189 

GAGGAACCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTCCGCCGCGG 
rTGGGCCGTCGGAGAGTGCGTGTGCTTCT 

cStgaggatc^agta^acaggaagcgaccccgcgatggcaagg 

AG^ATTAGAAA^GAGCTGAAGACCATT 

TGAAGTAA^CTAGACAGAAGTTCT^AAATTTGCATATTACATC^VA 
TTAAA^CAGAACTTG^TAW 

GMGATTCTCTCTCTTCTGATAGTGTATATGGCCA 
ACTCTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTTTCAA^ 

AZ^CCCGAATAACCCAAATC 

TCTACGGAAAA^GTATGACAAATATGGAGAAAAGGGACTTGAGGA 
CtStTATCGTTATGAT^TTTGGT^ 

tgc^t^aaSct^aactgtggtttgtaaatttttactccccaggctgctcacactgc 

CACATGGAGAGACTTOGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGA 

g£™ccgaatga^^ 

======== 

^t^atgtttScag^gtctctagcagtatttaaaggacaaggaaccaaa^ 
gaIg^t?ctm"S^tacttgcctttgccaaagaaag 

TTTTCC1GCCAATGACAAM 

TGAAGGACATCACTCTG^TGAACAAATCTTGGAGTTCATAGAGGATCTTATGAATCCT^ 
ACCCA^CA^CTTCAACGAACTAGTTACACAAAGAAAACACAACGAAGTCTGGATGG 

GTffTCA^CC^GCCAAGTCTTA^^ 

caotmagattg^ 

TCCCCCAAA^TCAAA^AAAGCTTATCAGTATCACAGTTACAATGGTTGGAA^ 

ctg^^gga^™^ 

GAAAAA^CATTGGGTGATTGAT^ 

CTTGC^TAGGATGATTAAAG^SAAAAGTGAAAGCTGGAA^ 

Sc^Satcagggcc^ 

gataaa^acSgagatgcaaaagcaatcgctgc^ 

gaggaatamjgatgaacit^tga^^ 

a^gacaccta^ttagaatgttac^tttatgatgggaatgaatgaa 
Sc^^actgcto^cctcgacggacttataatotttcaggtggctggcttga^ 

ot^aAcA^OTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGAT^^ 
^G^AAAAA^^ 

AAATTTGA*GCAA^AGTAACTGCACAAATTCTGTAGTTTGCTGTATCATCCAGG 

TTTTTCTiCTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAG^TTAT 
AATGA^CTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCOT 

c~™ga!tcIcaa£^^ 

ccaatcagtcaaaagaggtcaatgaattaaaaggcttc^ 
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FIGURE 190 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ ss .DNA56439 
<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGVWLNKDD YI RDLKR 1 1 LCFL I VYMAI LVGTDQDF YSLLG VS KTAS SRE I RQAFKKLALKL 
HPDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGI 
YDDDPE 1 1 TLERRJSFDAAVNSGELWFVNFYS PGCSHCHDLAPTWRDFAKEVDGLLRIGAVNC 
GDDRMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESL 

IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 
TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLK^DHIQVGRFDCSSAPDICSNLYVFQP 
SLAVFKGQGTKEYEIHHGKXILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 
PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 

AEQILEFIEDLMNPSWSLTPTTFNELVTQRKH^^ 

LTGLINVGSIDCQQYHSFC^QENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 
FLPQVSTDLTPQTFSEKVLQGKNHWIDFYAPWCGPCQNFAPEFELLARMIKGKYKAGKVDC 
QAYAQTCQKAGIRAYPTVKFYFYERAKRNFQEEQINT 

Important features: 

Endoplasmic reticulum targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 4 84-4 87 



Patent Application Publication Mar. 27, 2003 Sheet 192 of 237 US 2003/0060406 Al 



FIGURE 191 

AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGAACCAGAAGGAAGAGGACAGAGCAAA 

GCCATGAACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 

GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 

TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 

ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 

ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTG CAGC AACAGAGAAGAGAT CTATCG CT 

CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 

GT AT AT CC AG CCGATCTTCTCAGCACCAAGGATGAAG AGATTAC CAAGACATTTGAGGTCAA 

CATCCTAGGACATTTTTGGATCACAAAAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 

GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 

TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 

GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 

AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 

GATGGAAT ACTTAC C AAT AAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTT CTGAG 

ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 

TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAATGAATAAATAAGCTCCAGCCAGAGATG 

TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 

TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 

ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 

TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 

TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 

TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 

AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 

ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 

AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 

TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 

CAATGGACCCAAGAGAAGAA 
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FIGURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56409 
<subunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNIILEILLLLITIIYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSI 

LVUWD INKRGVEETAAECRKLGVTAHAYVVDC SNREE I YRSLNQVKKEVGDVT I WNNAGTV 
YPADLLSTKDEEITKTFEVNIL.GHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
SSKFAAVGFHRGLTSELQALGKTGIKTSCLCPVFVNTGFTKNPSTRLWPVLETDEVVRSLID 

G I LTNKKMI FVPSY INI FLRLQKFLPERASAI LNRMQN I Q FEAWGHKI KMK 

Important features : 
Signal peptide: 

amino acids 1-19 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 
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FTGURE 193 

CGGCGGCGGCTGCGGGCGCGAC3GTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 

ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 

AGGATGACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 

GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 

CTAC3GCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 

GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 

CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGeCGGGGAGCATGGAGGAGAGCGTGAGAG 

GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 

CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 

CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 

CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 

AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 

CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 

CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 

TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 

CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 

GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 

CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 

CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 

AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 

ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 

GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 

GAGACIGAAAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 

AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 

ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 
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TTTttTTRE 194 

< / usr/ seqdb2 / sst /DNA/Dnaseqs . mill/ ss . DNA5 6112 
oubunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLS 
GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKyTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 

ASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 

amino acids 329-332 
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PTGTTRE 195 

TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAGGTGA 

AAGAGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGGGATGGCTACGGGAACGCGCT 

ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 

GCCTTCGTGAACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGC 

CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 

TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 

GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 

GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 

AGAGTCAAGGGAATGTCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGCA 

GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 

AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 

AGGAGCTGGCAGCCTTAATGCCAGACCCTAGGGCCACAATCCGAGAGGGCATGCTGGCCCAG 

CCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCAGTGTTCCTGGCCTCCGA 

AGCCAACTTCTGCACGGGCATTGAACTGGTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 

GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTCC2S&TTTCTCTCATTT 

CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 

CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 

AAAAACGATTTGCAGCC 
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FT CURE 196 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56045 

<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MATGTRYAGKWWTGGGRGIGAGIVRAFVNSGARWICDKDESGGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCWNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 

ALP YLRKS QGNVI N I S S LVGAI GQAQ AVP YVATKGAVT AMTKALALDES P YGVRVNC T S PGN 
IWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 

AELGYGCKASRSTPVDAPD I PS 

Important features: 
N-glycosylation site. 

amino acids 13 8-141 

Short-chain alcohol dehydrogenase family protein 
amino acids 10-22, 81-91, 134-171 and 176-185 
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FTGURE 197 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATGGACTGGCCTCACAA.CCTG 
CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 
GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 
TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATG 
GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 
GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 
CTGTGGGCTGCACCTGCATCTTC1GAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 
CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 

GCAAG 
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FIGURE 1^8 

</usr/ segdb2 / ss t /DNA/Dnaseqs . min/ ss . DNA59 2 94 

<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX(S/T): 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNIEEKVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYS INHDPSRI PVDLPEARCLCL 
G CVNP FTMQ EDR SMVS VPVF S QVPVRRRL CP PP PRTGP CRQRAVMET I AVGCTC I F 

Important features: 
Signal peptide: 

amino acids 1-2 0 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-17 

amino acids 96-180. 
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FIGURE 199 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 

CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 

GCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 

CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 

TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 

TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 

TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 

GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 

TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 

AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 

TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 

CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 

TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 

GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 

GAGGAAAAAGATGATTCAAATGAAGAAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 

GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 

CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 

ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 

TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 

TGTAGATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 

CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 

CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 

GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 

CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 

AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCT 

CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 

TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 

ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 

TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 

CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 

TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 

GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 

TTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 

AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 

CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 

TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 

GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 

TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 

GCTAATTTCTTT 
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FTGURE 200 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56433 

<subunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX(S/T): 1 

MAGGRCG PQLT ALLAAW I AAVAATAGP EEAAL P P E Q S RVQ PMTASNWTLVMEGE WMLKFYAP 
WC PS CQQTDS EWE AFAKNGE I LQ I S VGKVDVI QE PGLSGRFF VTTLPAFFHAKDG I FRRYRG 
PGIFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFT^ 
CSYVFFVIATLVFGIiFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 

E VE PEEAEEG I S EQP CPADTE WEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 
amino acids 1-22 



Transmembrane domain; 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. 

isomerase) 

amino acids 56-72 



(homologous region to disulfide 



Flavodoxin proteins 

amino acids 173-187 
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FTGURE 201 

gaScctatgc^^^ 

AG^CCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 

^SStgggatcmaagtaaaagacaatgctggcaactgggctctaatctcctj^^^ 

a^gScaaatcaattgggatttatagtaaaacaagtgtcttcccaacctgat^ 

tSatag^tttctcggccccagactgaggtaccatggctctgggaggatggatc^ 

CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAAT^ 

Stggat^cgtct^gtcatttatgaccaactgtgtagtgtgccctcatatagtatttgt 

GAGAAGAAGTTTTCAATGTAAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 

aggaggaSaa^ 

gtatttgtcgacctacaggtaggctagtattatttttctagttagtagatccctagacatgg 
SSgggcaI^ 

cacStgttacccaggctggagtgcagtggcacaatctcgactcactgcagctatctctcgc 
ScagccStcS^ 

ttttttgtagagactgggttttgccatgttgaccaagctggtctctaactcctgggctt 
™c?gcc?gcct^ 

cccaagcttgaattttcattctgccattgacttggcatttaccttgggtaagccataagcga 
Sc^tJtctggctctatcaga^ 

otS™cacgaStgaccctcaacttctagcagtatatcagttatgaactga^^ 
Satttctcaatagctaaatgaagaaatgggaaaaaatc 

ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAA 

??tt??cStatcaattaaatagctcagagagtacatctgccatatctctaatagaatctt^ 
ttttttttttttttttt^^gj^acagagtttcgctcttgttgcccaggctggagtgcaacgg 

ScgSc^ggctcaccgcaacctccgccccctgggttcaagcaattctccto^ 

cccaagtagctgggattacagtcaggcaccaccacacccggctaattttgtatttttttagt 

agagacagggtttctccatgtcggtcagggtagtcccgaactcctgacctcaagtgatctgc 

Jtccc?cgg?ctcccaagtgctgggattacag^ 
aaS?a^ataaaaatacataaaatctgatgatgaatataaaaa^ 

ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 

acc1?St^cagtaattactgtaaaatggtattattggaa TO 

TGCTGATTTGTCTTATTTTTTTGATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGAT 

l^TACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2 02 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53912 

<subunit 1 of 1, 201 aa, 1 stop 
<MN: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 

CSVPSYSICEKKFSM 

Important features: 

Type II transmembrane domain: 

amino acids 45-65 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 
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T?TQT ?RE 203 

gga.ggggaggagcaggcca^^^ 

CACAGGCTGGAGTGCACTGCTG^ 

TATGGCAAGCGTGACTTCrTGCTGAG^CAAAGCCTCTAG^ 
GCTCAGGGCCCCCXGCTGTTAGCCACWCTGTCACCTCCTGG^AG 

GAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGO 

AGCCAAGTCCTGGGTGAGAAGGTCTTGG^ 
GTGCTCACTTTCCAGCACCAG^TACAGCCGAAG^TGT^ 

GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA^ 

AAGTGGTCACATGTGCTGACACTCCTGGG^ 
TTTGCTTCTGGCACCTTCCAGOTGTCGTCCTCTACCTT^ 

TTCATCTCGTACTGGTCCATGCGGCTGCA^^ 
AGCTCGCCTACCTCTGAGCCCAGGCCCCCTCC^ 

=^^^^= 

TAATTCTGTCCAACWttCACACft^ 
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FIGURE 204 

</usr/seqdb2 / sst /DNA/Dnaseqs . min/ ss . DNA5 0 9 2 1 
<subunit 1 of 1, 6 93 aa, 1 stop 
<MW: 77738, pi: 8.87, NX(S/T): 7 

MTPQSLLQTTLFLLSIiLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLIATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVIX3IWQNTKVANLTEPWLTFQHQLQPKNVTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 
TQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSAIiACLVTIAAYLCSRVPLPC 

RRKPRDYT I KVHMNLLLiAVF LLDT S FLLS E PVALTGS E AGCRAS AI FLHFSLliTCLSVJMGLE 
GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLG 
LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 

ISSGSTSSSRI 

Important features: 

Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C-terminal targeting signal. 

amino acids 691-693 

CAMP- and cGMP -dependent protein kinase phosphorylation sites. 

amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 
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FTfltTRE 205 

TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCAAGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 

GGTGTTTCTGTTCAACATGG 



Application Publication Mar. 27, 2003 Sheet 207 of 237 US 2003/0060406 



T?TflTTRE 2 06 

cggacgcgtgggcggacgcgtgggc^ 

GTGGGAACTGTGCCTGTTGTCTCC^^TOG^TG^TGTC^l^^ CGTGGCCCTGGAGC 
ACGAGGTCCTTC^GTTGAGACCARAGACCGGTGTCAGGGGATTGCAC 



ATGTAAACCAGAACATTCTATGTACTACAAACCTG^ 
GTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCT lAit- 1 ^.aTp-^rGCTAAATATATCT 
TTTCATTGTGTACATTTTTATATTrcTCCTTTTGACA 
ATTTTTACCAAAGGTATTTAATATTCTTTT 



AAAGCACCTTGAAARAGACTTGGCAGCTTCCTGATAAAGC^GTGCTGTGC 

TTGTGATOTGTGGTTTTAT^CTT^ 
GAAGTATGTCTCTTAACC^GTTCACTTAJTC^ 

ATGTGGC^StTTTGG^GAGAAwS^ 
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FTGURE 207 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 

F PHTYPRNTVLVWRLVAVE ENVWIQLTFDERFGLEDPEDD I CKYDFVEVEE PSDGT I LGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLiNLiLjTEEVRLY 
SCTPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSKVTKKYHEVLQ 

L.RPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



Signal sequence: 

amino acids 1-14 
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TPJflURE 208 

AAACTCCTTGG^CTAGATGGTTTC^^ 



MCGTGGRCTGCACAGACAAGCATTTGACAGAAM 

TGTAAAAATAATTCTCCCCTACAGATCCCTGTAA^ 

CACAGTAACTCTCTTCAGCATGTGCCCCCAAGATGGT^ 

CAAAACTTCTTGGCCAAAGAAATTGGGGATGCTAAAT 

CTGAAAGTCATAGATCTTTCAGTGAATAAARTATCACCTTCAGGAGAT 
GCCAGAACTTCTGTAGAAAGTTATGAA.CCCCAGGTCCTOGAAC 

CTGCAGAAACTGATGATGAACGACAATC 
TTCTGT;^GGCCAAGATAAAG<3G<nATCA^ 



Patent Application Publication Mar. 27, 2003 Sheet 210 of 237 US 2003/0060406 Al 

uTanRE 209 



MVFPMWTLKRQILILFNIII.ISKI.LGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGG 
IPTNTTNLTIjTINHIPDISPASFHRIjDHLVEIDFRCNCVPIPIiGSKNNMCIKRLQIKPRSFS 

GLTYLI 1 



jKSLyiiDGNQIiLEIPQGLPPSLQIjLSLBANNIFSIRKKNLTEIiANIEILYLGQNCYyR 



NPCYVSYSIEKDAFLNLTKIjKVIjSLKDNNVTAVPTVLPSTIjTELYIjYNNMIAKIQEDDFNNL 
NQLQILDLSGNCPRCYNAPFPCAPCKNNSPLQIPVNAFDAIiTELKVLRLiHSNSLQHVPPRWF 
KNINKLQELDLSQNFLAKEIGDAKFIiHFLPSLIQLDLSFNFEIjQVYRASMNLSQAFSSLKSL 
KILRIRGYVFKELKSFNLSPLHNLQNLEVLDIjGTNFIKIANLSMFKQFKRLKVIDLSVNKIS 

psgdssevgfcsnartsvesyepqvleqlhyfrydkyarscrfknkeasfmswescykygq 

TLDLSKNS I FFVKS SDFQHLSFLKCLNLSGNLI SQTIjNGSEFQPIAELRYLDFSNNRU > 1 j IjH 
STAFEELHKLE VLD I sSNSHYFQSEGITHMLNFTKNLKVLQKIjMMNDND I s s stsrtmes es 

lrtlefrgnhldvlwregdnrylqlfknllkleeldisknslsflpsgvfdgmppnlknlsl 

AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSIjKNLILKNNQIRSIiTKYFLQ 
DAFQLRYLDLSSNKIQMIQKTSFPENVI^LKMbLLHHNRFLCTCDAVWFWWVNHTEVTIP 
YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNIiILFSLSISVSLFIiMVMMTASHIjYFWDVW 
YI YHFCKAKI KGYQRLI SPDCC YDAFIVYDTKDPAVTEWVIiAELVAKLEDPREKHFNIjCLEE 
RBWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 

kpfqkskflqlrkrlcgssvlewptnpqahpyfwqclknalatdnhvaysqvfketv 

Signal sequence: 

amino acids 1-26 



Transmembrane domain : 

amino acids 840-860 
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FTftTTRE 210 

CCGGTATACAATCAAATGGCTTGAATATCAC^ 

TTAAAAATATGCCTCATCTGflAGGTGCTGGATC^^ 
TTTCAACGATGCTGCCCCGCTTACMU^A™ 
TTAATATTTCC^GA^CTTCTCTAAACT^^ 
TCAGftGAAGATGATTTCCAGCCOCTGATOCA^CrrcCAARCrr 

AGCAAATCGATTTCAAACTTTTCCAA^^ 

CGTTGGTAAAAGATACCCGGCAGAGTTATGCAAATAGTTCCTCTTOIC^^^ ffrGCrIGCrr 
CAGATTTTGAGTTTGACCCACATTCGAACTTTTATX^Tj^C^ 

ATTTCACAJ^TCTAAAAGTTTTAA&CTTGAGCC^ 

TATCTGACTTTACATCTTCCCTTCGGACAC^^ 

CTTACATTTCTTATGAC^CC^RAGATCCCTCTCTTACTCACTGG^T^AT 
ACCCGAAGGCAGAAGGCTTGTTTTGGCAAACT^TGAGAAATGTGGTCTTGA^ 



TCTGCTGGCCCACAGTTTTTGAGGGTCAGGAGTCCAGGC 
AGGCTGCAATGTAGGTGTTC^CCAGAGA^TA^^ 

ATTCCTCCTGGGCTATTGGCCAAAGGCTATACT^TGTAAGCCATGCGAGCCT 
ATCAGAGCTAGCAAAAAAGAGAGGTTGCTAGCAAffl 



TTGTTAATTGCCATTGCTGTAAAT < 
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FIGURE 211 

MENMFLQSSMLTCIFLLtSGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 

KYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL 

KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 

CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 

TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 

PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 

HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 

KDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPIjIKPQCAAYGKAIjDIjSIiNSIFFI 

GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 

LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 

DILWNDDDNRYISIFKGLKNLiTRIiDI»SIiNRLKHIPNEAFLNIjPASLTEIiHINDNMIiKFFNWT 

LLQQFPRLELLDLRGNKLLFL.TDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSI.KHLDLS 

SNLIiKTINKSALETKTTTKLiSMIiELHGNPFECTCDIGDFRRWMDEHIjNVKI PRLVDVICASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMLAALAHHLFYWDWFIYNVCLAKVK 
GYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NLMQSINQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVIIFILLEPVLQHSQYLRL 
RQRICKSSILQWPDNPKAEGLFWQTLRNVVLTENDSRYNNMYVDSIKQY 



Signal sequence: 

amino acids 1-26 



Tr an smembr ane doma in : 

amino acids 826-848 
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FTGURE 212 

CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 

CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 

CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 

AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 

TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 

GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 

GGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 

CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 

CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 

CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 

CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 

CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 

GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 

GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTCCATCAACACCGCCGG 

CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 

CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 

GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 

CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 

TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 

GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTG 

GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 

AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 

TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 

CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 

AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 

GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 

CGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 
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FIGURE 213 

MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSF<3QLG 

RIDSLSEQI SFLEEQLGSCSCKKDS 
Signal sequence: 
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FIGURE 214 

GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGQCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 

AGTGGATGAGC AACC C AACGGGGGCCCGGGGAGGGGAACTGG CC C CG AGGGAGAGGAAC CCC 

AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 

CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 

TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGC 

TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 

CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 

TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 

CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 

CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 

GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 

GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 

TCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 

TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 

ACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 

GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 

ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 

TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 

CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 

CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 

AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 

GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 

TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 

CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 

CCCCAGCACAATAAAAATGAAACGTG 
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FIGURE 215 

MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 

RIDSLSEQ I S FLEEQLGSCSCKKDS 



Signal sequence: 

1-19 
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FT CURE 216 

CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGfGACAGGCCAGGCA 

GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 

TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 

CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 

CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 

CTGGAGGCACAGGCCAT2AGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 

AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 

GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 

GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 

GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 

CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 

CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 

ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 

GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 

CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 

GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 

GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 

TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 

GGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 

TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 

GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 

GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 

CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 

GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCC CGGAGGCTGGGTGGGG 

CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 
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FT CURE 217 

MRGSQEVLLMWLLVIAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCVNTAGSYWCQCMEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 

RIDSLSEQI SFLEEQLGSCSCKKDS 



Signal sequences 

1-19 
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PT-ftTTRE 218 



■■■■■■I 

^CTTTTGTTTGTGTT 
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FIGURE 219 

MSVMVVRKKVTRKWEKLPGRNTFCCDGRVMMARQKGIFYLTLFLILGTCTLFFAFECRYIAV 
QLSPAIPVFAAMLFLFSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPPP 
RIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYL 
FILSLSLLTIYVFAFKIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 
LVALNQTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 

Putative transmembrane domains s 

amino acids 36-55 (type II TO), 65-84, 188-208, 229-245 
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FTCTTRE 220 

AAAACCCTGTATTTTTTACAATGCAAATAGRCAATNANCCTGGAGGTCTTTGAATTAGGTAT 
TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
CCCACAGAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 
CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 



ACATCGT 
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PTflURB 221 

GTTGTGTCCTTCAGCJ^AAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 

TCTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 

AAAAAAATC^SGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTCAC 

GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 

CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 

GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 

CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 

AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 

CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 

TTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 

CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 

GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 

CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 

TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 

TCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 

GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 

AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 

ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 

CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTTTGATGTGAGTGCC 

ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 

CGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGA 

AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 

AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACX3GGAAGAACACAGC 

ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 

GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 

AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 

GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 

AAAAA 
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FIGURE 222 

MKTI QPKMHNS I SWAIFTGL AALCLFQGVPVRSGDATFPKAMDNVTVRQGES ATLRCTI DNR 
VTRVAWLNRSTI IiYAGNDKWCLDPRWLL SNTQTQYS I E I QNVDVYDEGPYTCS VQTDNH PK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPWRRVKVTVNYPPYISEAKGTGVPVGQKGTIiQCEASAV 
PS AEFQWYKDDKRL I EGKKGVKVENRPFLSKLI FFNVS EHD YGNYTCVASNKLGHTNAS IML 
FGPGAVS E VSNGTSRRAGC VWLLPLLVLHLLLKF 



Signal peptide: 

amino acids 1-28 
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FTGURE 223 

GAAAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC 
ACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 
CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 
TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT 
GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 
CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 
ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 
TCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 

ACCAGAG 
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PT-ttTTRE 224 

atcaccgcch^cco^ctccac^ 

AAGGGGACAAGACAGCTG^AGGCTCACGCACGCAGCTGGAGCTGG 
GCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACC 

CAGTTCTCCTGTGK3GGGCTGGATrC«»GGAA^^ 
CTCTGGGACCAAAACCAGGC^TACT^^ 
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FIGURE 225 

MNVALQELGAGSNVGFQKGTRQLLGSRTQLELVIAGASLLLAALLLGCLVALGVQYHRDPSH 

STCLTEACIRVAGKILESLDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 

ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRD1.IEKIGGWNITGPWDQDN 

FMEVLKAVAGTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAY 

LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKI YHKMS I SELQALAP 

SMDWLEFLSFLLSPLELSDSEPWVYGMDYLQQVSELINRTEPSILNNYLIWNLVQKTTSSL 

DRRFESAQEKLLETIiYGTKKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMI 

SEIRTAFEEALGQLVWMDEKTRQAAKEKADAIYDMIGFPDFILEPKELDDVYDGYEISEDSF 

FQNML^LYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 

PKALNFGGIGWMGHELTHAFDDQGREYDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQV 

NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCS 

VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 



Type II Transmembrane domain: 

amino acids 32-57 
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FIGURE 226 

GCCCGGCCCTCCCKrCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCOCGCGCCCTCCTCCCTCCCTCCTCCC 

CAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCGCTGCTGCTCCTCGGGCTGCTGCTGCT 

CGGCTCCCGGCCGGCCCGCGGCGCCGGCCCAGAGCSXCCCGTGCrGCCCATCCGTTCTGAGAAGGAGCCGCTGCC 

rGTTCGGGGAGCGGCAGGTAGGTGGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGC 

CAGCCCGGAGGGGGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAGGGC 

GCGGTGCCTGGGACCCGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTGGGCAGCG^CTCCAGC 

CAAGCCCGTCCCCGGAGGCTGCACCTTCGGCGGGAAGGTCTATGCCTTGGACGAGACGTGGCACCCGGACCraGG 

G^GCCATTCGGGGTGATGCGCTGCGTGCTGTGCGCCTGCGAGGCX3CAGTGGGGTCGCCGTACCAGGGGCCCTGG 

CAGGGTCAGCTG^SvGAACATCAAACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTGCCGGGACA 

CTGCTGCCAGACCTGCCCCCAGGACTTCGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACGAGCCCG 

AGTCTCGCTGCTGCX3CTCTAGCCTCCGCTTCTCTATCTCCTAC^GGCGGCTGGACCGCCCTACCAG(^TCCGCTT 

CTCAGACTCCAATGGCAGTGTCCTGTTTGAGCACCCTGCAGCCCCCACCCAAGATGGCCTGGTCTGTGGGGTCTG 

GCGGGCAGTGCCTCGGTTGTCTCTGCGGCTCCTTAGGGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCA 

CCCTTCAGGGGAGGTCTGGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCAGAGACCTTCAGTGCCATCCTGAC 

TCTAGAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCTGCTCACTCTCAGTGACACAGAGGACTCCTT 

TTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGCTCCAGATTCTACACCAGGGGCA 

GCTACTGCGAGAACTTCAGGCCAATGTCTCAGCCCAGGARCCAGGCTCTGCTGAGGTGCTGCCCAACCTGACAGT 

CCAGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCAT 

CAGTGGACACATTGCTGCC^GGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTGATCCC 

AGTCCAAACGGGTGCTGCCGGCTCAGCC^GCCTCACTCTGCTAGGAAATGGCNCCCTGATC 

GGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCT 

CTGCX^l^TGGCTGGCCTATCCTCCCCTGC 

TCATATGCTGCTGCAGAATGAGCTCTTCCTGAACGTGGGCACCAAGGACTTCCCAGACGGAGAGCTTCGGGGGCA 
ACGTGGCTGCCCTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTACC 
CCCT^TGAAGAGCCAAGCAGCAGGGCACGCCTGGCTTTCCTTGGATAC^ 

GC^CTGGGCT^TGGCTCAGAACAAGGCACTGTCACTGCCCACCTCCTTGGGCCTCCTGGAACGCCA 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGG^ 

GCACCTGGCAAAAGGCATGGCTTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCAGCCT 

CTCCTCCCAGGTGCACATAGGGAACCAATGTGAGGTTGGCGGACrrGCGCCTGGAGGCGGCCGGGGCCGAGGGGGT 

GCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGC 

CAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCC^ACACATGCTTCTTCGiAGXMGCAGCAGCGCCCCCACGGGGC 

TCGCTGGGCGCCCAACTACC^CCCGCTCTGGTCACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCC^ 

GGTGTGCCCACCGCCCAGCTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTA 

TTTTGATGGTGACCGGAGCTGGCGGGCAGCGGGTACGCGGTGGC^CCCCGTTGTGCCCCCCTTTGGCTTAATTAA 

GTGTGCTGTCTGCACCTGCAAGCAGGGGGGCACTGGAGAGGTGCACTGTGAX3AAGGTGCAGTGTCCCC 

CTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTGCAAACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGG 

GGACCCCATGCAGGCTGATGGGCCCCGGGGCTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGCA 

CCCCTCAGTGCCCCCGTTTGGAGAGATGAGCTGTATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGT 

GTGAGGTGGGTACTGGGAGCCTGGTCTGGAGTAGGGAGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCA 

CTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCTGTCTTGTACCAGGCAGGGGTGCCTCACTGTGAGC 

GGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGGAAGGAGAGTCGATGCTGTTCCCGCTGCACGGCCCACC 

GGC^CGTAAGTGAGGGAGTCCAGGGTCAGCAGCTGTGAGTGGAGGGCTCACCTGCCTGTGGGACTCCTGATCAG 

ggaagIgagISc^ct^ 

ACCTGGTGGAATTGTTATTTATGACCTTTTCTTTACAAATGAGATTTCTGAAGCTCAGAGAAATTAAGCAACGAG 

ATGAAGGTCACCCAGCTGTGTGCACTGACCTGTTTAGAAAATACTGGCCTTTCTGGGACCAAGGCAGGGATGC^ 

TGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCACTCCCTCTCCCCTCCTCCAACATTCC 

AGCAGCCCCaGAGACCAGAACTGATCCAGAGCTGGAGAAAGAAGCCGAAGGCTCTTAGGGAGCAGCCAGAGGG 

^CTGACCAAGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGGGAAG 

CCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAACCACAGCTCCACAAGGGG 

GAGAGGCAGCTGGGCCAGACCGAGGTCACAGCCACTCCAAGTCCTGCCCTGCOVCCCTCGGC^ 

GCCCCACCCCTTTCTTCCTGTACATAATGTCACTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAG 

GGCCCCGGACACTCCACTCCTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 

AmCTTOT^GTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGGAGAAGGG 

GCNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGA(^CTTGGGGAGCTGAAAGAGACCTGGAGAGGCAGAGGATAG 

CGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTGGGGATGGTTC 

AATTTAGGGAAGTAGAAGCAGGATTTTGACTCAAGTTTAGTrrCCCACATCGCTGGCCTGTTTGCTGAOT 
mGAAGTTGCrCCAGAGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCT 

TTCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCC 
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GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 

CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 

TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 

GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 

TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 

TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 

GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 

GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 

CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 

GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 

AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 

GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 

CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 

GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 

ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 

TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 

CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 

TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 

GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 

AGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 

AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 
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ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 

TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 

TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 

CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 

TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 

TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 

GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 

GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 

CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 

ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 

TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 

TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 

GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 

GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 

GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 

CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 

TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 

TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 

CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 

TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 

GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 

GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 

TGGCCCTTATAGGACC 
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CCCACGCGTCCGAGCCGCCCGAGAATTAGACACACTCCGGACGCGGCCAAAAGCAACCGAGA 
GGAGGGGAGGCAAAAACACCGAAAAACAAAAAGAGAGAAACAACACCCAACAACTGGGGTGG 
GGGGAAGAAAGAAAGAAAAGAAACCCACCCACCCACCAAAAAAAAA^ 

AAAAAAAAAAAATCCTGTGGCGCGCCGCCTGGTTCCCGGGAAGACTCGCCAGC AC CAGGGGG 

TGGGGGAGTGCGAGCTGAAAGCTGCTGGAGAGTGAGCAGCCCTAGCAGGGATGGACATGATG 

CTGTTGGTGCAGGGTGCTTGTTGCTCGAACCAGTGGCTGGCGGCGGTGCTCCTCAGCCTGTG 

CTGCCTGCTACCCTCCTGCCTCCCGGCTGGACAGAGTGTGGACTTCCCCTGGGCGGCCGTGG 

ACAAC ATGATGGT C AGAAAAGGGGACACGGC GGTG CTT AGGTGTT ATTTGGAAGATGGAGCT 

TCAAAGGGTGCCTGGCTGAACCGGTCAAGTATTATTTTTGCGGGAGGTGATAAGTGGTCAGT 

GGATCCTCGAGTTTCAATTTCAACATTGAATAAAAGGGACTACAGCCTCCAGATACAGAATG 

TAGATGTGACAGATGATGGCCCATACACGTGTTCTGTTCAGACTCAACATACACCCAGAACA 

ATGCAGGTGCATCTAACTGTGC AAGTTC CTC CT AAGAT AT ATG AC ATCTC AAATGAT ATGAC 

CGTCAATGAAGGAACCAACGTCACTCTTACTTGTTTGGCCACTGGGAAACCAGAGCCTTCCA 

TTTCTTGGCGACACATCTCCCCATCAGCAAAACCATTTGAAAATGGACAATATTTGGACATT 

TATGG AATTACAAGGGAC CAGGCTGGGGAATATGAATGC AGT GCGGAAAATGCTGTGTC ATT 

CCCAGATGTGAGGAAAGTAAAAGTTGTTGTCAACTTTGCTCCTACTATTCAGGAAATTAAAT 

CTGGCACCGTGACCCCCGGACGCAGTGGCCTGATAAGATGTGAAGGTGGAGGTGTGCCGCCT 

CCAGCCTTTGAATGGTACAAAGGAGAGAAGAAGCTCTTCAATGGCCAACAAGGAATTATTAT 

TCAAAATTTTAGCACAAGATCCATTCTCACTGTTACCAACGTGACACAGGAGCACTTCGGCA 

ATTATACCTGTGTGGCTGCCAACAAGCTAGGCACAACCAATGCGAGCCTGCCTCTTAACCCT 

CCAAGTACAGCCCAGTATGGAATTACCGGGAGCGCTGATGTTCTTTTCTCCTGCTGGTACCT 

TGTGTTGACACTGTCCTCTTTCACCAGCATATTCTACCTGAAGAATGCCATTCTACAASA^A 

TTCAAAGAC CCATAAAAGGCTTTTAAGGATT CT CTGAAAGTG C TGATGGCTGG ATCC AAT CT 

GGTACAGTTTGTTAAAAGCAGCGTGGGATATAATCAGCAGTGCTTACATGGGGATGATCGCC 

TTCTGTAGAATTGCTCATTATGTAAATACTTTAATTCTACTCTTTTTTGATTAGCTACATTA 

CCTTGTGAAGCAGTACACATTGTCCTTTTTTTAAGACGTGAAAGCTCTGAAATTACTTTTAG 

AGGATATTAATTGTGATTTCATGTT TGTAAT C T AC AACTTTT C AAAAGCATTCAGTCATGGT 

CTGCTAGGTTGCAGGCTGTAGTTTACAAAAACGAATATTGCAGTGAATATGTGATTCTTTAA 

GGCTGCAATACAAGCATTCAGTTCCCTGTTTCAATAAGAGTCAATCCACATTTACAAAGATG 

CATTTTTTTCTTTTTTGATAAAAAAGCAAATAATATTGCCTTCAGATTATTTCTTCAAAATA 

TAACACATATCTAGATTTTTCTGCTTGCATGATATTCAGGTTTCAGGAATGAGCCTTGTAAT 

ATAACTGGCTGTGCAGCTCTGCTTCTCTTTCCTGTAAGTTCAGCATGGGTGTGCCTTCATAC 

AATAATATTTTTCTCTTTGTCTCCAACTAATATAAAATGTTTTGCTAAATCTTACAATTTGA 

AAGTAAAAATAAACCAGAGTGATCAAGTTAAAC CAT ACACTAT CTCTAAGT AACG AAGGAG C 

TATTGGACTGTAAAAAT CTCTTC CTGCACTGACAATGGGGTTTGAGAATTTTGCCCCACACT 

AACTCAGTTCTTGTGATGAGAGACAATTTAATAACAGTATAGTAAATATACCATATGATTTC 

TTTAGTTGTAGCTAAATGTTAGATCCACCGTGGGAAATCATTCCCTTTAAAATGACAGCACA 

GTCCACTC^AAGGATTGCCTAGCAATACAGCATCTTTTCCTTTCACTAGTCCAAGCCAAAAA 

TTTTAAGATGATTTGTCAGAAAGGGC ACAAAGT CCT ATC AC CTAAT ATTACAAGAGTTGGTA 

AGCGCTCATCATTAATTTTATTTTGTGGCAGGTATTATGACAGTCGACCTGGAGGGTATGGA 

TATGG AT ATGGACGTTCCAGAGAC TATAATGGCAGAAACC AGGGTGGTT ATGAC CGCTACT C 

AGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGCACATAATATAGATACA 

CAAGGAATAATTTCTGATCCAGGATCGTCCTTCCAAATGGCTGTATTTATAAAGGTTTTTGG 

AGCTGCACTGAAGCATCTTATTTTATAGTATATCAACCTTTTGTTTTTAAATTGACCTGCCA 

AGGTAGCTGAAGACCTTTTAGACAGTTCCATCTTTTTTTTTAAATTTTTTCTGCCTATTTAA 

AGACAAATTATGGGACGTTTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MMLLVQGACCSNQWLAAVLLSLCCLLPSCLPAGQSVDFPWAAVDNMMVRKGDTAVLRCYLED 
GASKGAWLNRS S 1 1 FAGGDKWSVD PRVS I STLNKRDYSLQ I QNVDVTDDGPYTCSVQTQHTP 
RTMQVHLTVQVPPKI YD I SNDMTVNEGTNVTLTCLATGKPEPS I SWRHI S PSAKPFENGQYL 
DIYGITRDQAGEYECSAENAVSFPDVRKVKVVVNFAPTIQEIKSGTVTPGRSGLIRCEGAGV 
PPPAFEWYKGEKKLFNGQQGI I IQNFSTRSILTVTNVTQEHFGNYTCVAANKLGTTNASLPIj 
NPPSTAQYGITGSADVLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain t 

amino acids 326-345 

N-glycosylation sites . 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site. 

amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site. 

amino acids 178-186 

N-myristoylation sites. 

amino- acids 7-13, 63-70, 67-73, 151-157, 239-245, 291-297, 
302-308, 319r325 



Myelin P0 protein: 

amino acids 92-121 
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AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAGGGGAGCATTTCTGCTGG 

CTCCAGGACTTTGGCCATCTATAAAGCTTGGCAATGAGAAATAAGAAAATTCTCAAGGAGGA 

CGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAA 

TCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATC 

TACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGC 

GCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 

CCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTG 

CAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACTTGCTGCAGCGGGT 

AGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAACAAGGCGCCCCAGGTC 

TTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCT 

GAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAGCAACAGGCCCCTCGGGACCCCAAGG 

CCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAG 

GAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTCATT 

GGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGAAGCAA 

AGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTA 

AAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGAT 

CAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGC 

CAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGA 

GTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGA 

AGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGG 

GCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAG 

GATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCC 

GTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGG 

GACAATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTT 

ACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 

GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCA 

TGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGCAGCGTC2G&CCCGGAAACCCTTTCA 

CTTCTCTGCTCCCGAGGTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGT 

TCCCTGGGGACAACTGAGCAGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 
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></usr/seqdb2/sst/DNA/Dnaseqs.full/ss.DNA68886 
xsubunit 1 of 1, 520 aa, 1 stop 
><MW: 52658, pi: 9.16, NX(S/T) : 3 

ABVYYSGTWGTICDDBWQNSDAIVFCRMI^SKGRALYKVGAGTGQIWLDNVQCRGTESTLW 
SCTKNSWGHHDCSHEEDAGVECSV 

Transmembrane domain: 

amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 432-440 

N-myristoylation sites. 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 

Amidation site. 

amino acids 360-364 

Leucine zipper pattern. 

amino acids 56-78 

Speract receptor repeat 

amino acids 422-471, 488-519 



Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 
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CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCTCT 
CTTCTCCTTTTGCCCAAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGCCAAG 

AC C AAAAGGAAAGAAGAAAAAGGGC CAAAAGCCAAA££GAAACTGATGGTACTTGTTTT C AC 
CATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTCTTGCTACA 
GAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGACCTGACACAG 
ATTGATGTCAATGTCCAGGATCATTTCTGGGATGGGAAGGGATGTGAGATGATCTGTTACTG 
CAACTTCAGCGAATTGCTCTGCTGCCCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCG 
TGATTCCTTGCAACAATCAATGAGAATCTTCATGTATTCTGGAGAACACCATTCCTGATTTC 
CCACAAACTGCACTACATCAGTATAACTGCATTTCTAGTTTCTATATAGTGCAATAGAGCAT 
AGATTCTATAAATTCTTACTTGTCTAAGACAAGTAAATCTGTGTTAAACAAGTAGTAATAAA 

AGTTAATTCAATCTAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNAS2758 

<subunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi: 6.68, NX (S/T) : 1 

MKLMVL VFT I GLTL LLG VQ AM P ANRL S C YRK I L KDHNC HN LPEG VADLTQI D VNVQDHF WDG 
KGCEMICYCNFSELLCCPKDVFFGPKI SFVI PCNNQ 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-76 

Tyrosine kinase phosphorylation site. 

amino acids 63-71 
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CCCACGCGTCCGCGGACGCGTGGGCTGGACCCCAGGTCTGGAGCGAATTCCAGCCTGCAGGG 
CTGATAAGCGAGGCATTAGTGAGATTGAGAGAGACTTTACCCCGCCGTGGTGGTTGGAGGGC 

GCGCAGTAGAGCAGCAGCACAGGCGCGGGTCCCGGGAGGCCGGCTCTGCTCGCGCCGAGATS 
TGGAATCTCCTTCACGAAACCGACTCGGCTGTGGCCACCGCGCGCCGCCCGCGCTGGCTGTG 
CGCTGGGGCGCTGGTGCTGGCGGGTGGCTTCTTTCTCCTCGGCTTCCTCTTCGGGTGGTTTA 
TAAAATC CTCCAATGAAGCTACTAAC ATTACTC C AAAGC ATAATATGAAAGC ATTTTTGGAT 
GAATTGAAAGCTGAGAACATCAAGAAGTTCTTACATAATTTTACACAGATACCACATTTAGC 
AGGAACAGAACAAAACTTTCAGCTTGCAAAGCAAATTCAATCCCAGTGGAAAGAATTTGGCC 
TGGATTCTGTTGAGCTAGCT CATT ATGATGT C CTGTTGT CCTACCC AAATAAGACTCATC CC 
AACTACATCTCAATAATTAATGAAGATGGAAATGAGATTTTCAACACATCATTATTTGAACC 
ACCTCCTCCAGGATATGAAAATGTTTCGGATATTGTACCACCTTTCAGTGCTTTCTCTCCTC 
AAGGAATGCCAGAGGGCGATCTAGTGTATGTTAACTATGCACGAACTGAAGACTTCTTTAAA 
TTGGAACGGGACATGAAAATCAATTGCTCTGGGAAAATTGTAATTGCCAGATATGGGAAAGT 
TTTC AGAGGAAATAAGGTTAAAAATGC CCAGCTGGCAGGGGC C AAAGGAGT C ATTCTCT ACT 
CCGACCCTGCTGACTACTTTGCTCCTGGGGTGAAGTCCTATCCAGACGGTTGGAATCTTCCT 
GGAGGTGGTGTCCAGCGTGGAAATATCCTAAATCTGAATGGTGCAGGAGACCCTCTCACACC 
AGGTTACCCAGCAAATGAATATGCTTATAGGCGTGGAATTGCAGAGGCTGTTGGTCTTCCAA 

gtattcctgttcatccaattggatactatgatgcacagaAgctcctagaaaaaatgggtggc 

TCAGCACCACCAGATAGCAGCTGGAGAGGAAGTCTCAAAGTGCCCTACAATGTTGGACCTGG 
CTTTACTGGAAACTTTTCTACACAAAAAGTCAAGATGCACATCCACTCTACCAATGAAGTGA 
CGAGAATTTACAATG TGATAGGT ACTCTCAGAGGAGCAGTGGAACCAGAC AG ATATGT CATT 
CTGGGAGGTC ACCGGGACTCATGGGTGTTTGGTGGTATTGAC CCT C AGAGTGGAGCAGCTGT 
TGTTCATGAAATTGTGAGGAGCTTTGGAACACTGAAAAAGGAAGGGTGGAGACCTAGAAGAA 
CAATTTTGTTTGCAAGCTGGGATGCAGAAGAATTTGGTCTTCTTGGTTCTACTGAGTGGGCA 
GAGGAGAATTCAAGACTCCTTCAAGAGCGTGGCGTGGCTTATATTAATGCTGACTCATCTAT 
AGAAGGAAACTACACTCTGAGAGTTGATTGTACACCGCTGATGTACAGCTTGGTACACAACC 
TAACAAAAG AGCTGAAAAG C C C TGATGAAGG CTTTGAAGGC AAATCTCTTT ATGAAAGTTGG 
ACTAAAAAAAGTCCTTCCCCAGAGTTCAGTGGCATGCCCAGGATAAGCAAATTGGGATCTGG 
AAATGATTTTGAGGTGTTCTTCCAACGACTTGGAATTGCTTCAGGCAGAGCACGGTATACTA 
AAAATTGGGAAACAAACAAATTCAGCGGCT ATC C ACTGTAT CACAGTGTCT ATGAAAC ATAT 
GAGTTGGTGGAAAAGTTTTATGATCCAATGTTTAAATATCACCTCACTGTGGCCCAGGTTCG 
AGGAGGGATGGTGTTTGAGCTAG C C AATTCCAT AGTGCTCCCTTTTG ATTGTCGAGATTATG 
g!TGTAGTTTTAAGAAAGTATGCTGACAAAATCTACAGTATTTCTATGAAACATCCACAGGAA 
^TGAAGACATACAGTGTATCATTTGATTCACTTTTTTCTGCAGTAAAGAATTTTACAGAAAT 
JEGCTTC CAAGTTCAGTGAGAGACTCC AGG ACTTTGAC AAAAGC AACCCAATAGTATTAAGAA 
TGATGAATGATCAACTCATGTTTCTGGAAAGAGCATTTATTGATCCATTAGGGTTACCAGAC 
AGGCCTTTTTATAGGCATGTCATCTATGCTCCAAGCAGCCACAACAAGTATGCAGGGGAGTC 
ATTCCCAGGAATTTATGATG CT CTGTTTGATATTGAAAGCAAAGTGGAC C CTTC CAAGGC CT 
GGGGAGAAGTGAAGAGACAGATTTATGTTGCAGCCTTCACAGTGCAGGCAGCTGCAGAGACT 
TTGAGTGAAGTAGC CTAAG AGGATTTTTTAGAGAATC CGTATTGAATTTGTGTGGTATGT C A 
CTCAGAAAGAATCGTAATGGGTATATTGATAAATTTTAAAATTGGTATATTTGAAATAAAGT 

TGAATATTATATATAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .full/ss .DNA52756 
xsubunit 1 of 1, 750 aa, 1 stop 
><MW: 84305, pi: 6.93, NX(S/T): 10 

MWNLLHETDSAVATARRPRWLCAGALVTJAGGFFLLGFLFGWFIKSSNEATNITPKHNMKAFL 
DELKAENIKKFLHNFTQIPHIJ\GTEQNFQIjAKQIQSQWKEFGL.DSVELAHYDVLLSYPNKTH 
PNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYVNYARTEDFF 
KLERDMKINCSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNL 
PGGGVQRGNI LNLNGAGDPLT PGYPANEYAYRRGI AEAVGLPS I PVHP IGYYDAQKLLEKMG 
GS APPDS S WRGS LKVPYNVGPGFTGNFSTQ KVKMH I HSTNEVTRI YNVIGTLRGAVEPDRYV 
ILGGHRDSWVFGGIDPQSGAAVVHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEW 
AEENSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYES 
WTKKSPSPEFSGMPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYET 
YELVEKFYDPMFKYHLTVAQVRGGMVFELANSIVLPFDCRDYAVVLRKYADKIYSISMKHPQ 
EMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLjERAFIDPLGLP 
DRP FY RHV I YAPS SHNKYAGE S F PG I YDAL FD I E S KVDP S KAWGE VKRQ I YVAAFT VQAAAE 
TLSEVA 

S ignal s equenc e : 

amino acids 1-4 0 

N-glycosylation sites. 

amino acids 76-80, 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tyrosine kinase phosphorylation sites, 

amino acids 363-372, 605-613, 606-613, 617-626 



N-myristoylation sites* 

amino acids 85-91, 168-174, 252-258, 256-262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 



